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METHODS AND REAGENTS FOR DISCOVERING AND USING 
MAMMALIAN MELANOCORTIN RECEPTOR AGONISTS AND 
ANTAGONISTS TO MODULATE FEEDING BEHAVIOR IN ANIMALS 

BACKGROUND OF THE INVENTION 
1. Field of the Invention 

The present invention relates to the cloning, expression and functional 
characterization of mammalian melanocortin receptor genes. The invention provides 
nucleic acid encoding mammalian melanocortin receptors, recombinant expression 
constructs comprising said nucleic acid, and mammalian cells into which said 
recombinant expression constructs have been introduced, and that express functional 
mammalian melanocortin receptors. The invention also provides a panel of such 
transformed mammalian cells expressing melanocortin receptors for screening 
compounds for receptor agonist and antagonist activity. The invention provides methods 
for using such panels of melanocortin receptor-expressing mammalian cells to 
specifically detect and identify agonists and antagonists for each melanocortin receptor, 
as well as patterns of agonist and antagonist activity of said compounds for the class of 
melanocortin receptors. Such screening methods provide a means for identifying 
compounds with patterns of melanocortin agonist and antagonist activity which is 
associated with the capacity to influence or modify physiological function and behavior, 
particularly metabolism and feeding behavior. 

2. Backgrou nd of the Invention 

The proopiomelanocortin (POMC) gene product is processed to produce a large 
number of biologically active peptides. Two of these peptides, cc-melanocyte stimulating 
hormone (aMSH), and adrenocorticotropic hormone (ACTH) have well understood roles 
in control of melanocyte and a drenocortical function, resprctiyely. Both of these 
hormones are also found in a variety of forms with unknown functions, for example, y- 
melanocyte stimulating hormone (yMSH), which has little or no ability to stimulate 
pigmentation (Ling et aL, 1 979, Life Sci. 25.'- 1 773-1 780; Slominski et a/., 1 992, Life Sci. 
5Q: 1 103-1 108). A melanocortin receptor gene specific for each of the aMSH, ACTH 
and yMSH hormones has been discovered by some of the present inventors (sec U.S. 
Patent Nos. 5,280,112, 5,532,347 and U.S. Application Serial No. 08/044,812, 
incorporated by reference herein). In addition, two other melanocortin receptor genes 
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* have been discovered by some of the present inventors (see Lu et aU 1 994, Nature 221: 
799-802; Mountjoy et ah 1994, Molec. Endocrinol g: 1298-1308) and others (see 
* Gantze/a/., 1993, J. Biol. Chem. 2££: 15174-15179 and Labbe et al, 1994, Biochem. 
21: 4543-4549). 

Along with the well-recognized activities of ccMSH in melanocytes and ACTH 
in adrenal and pituitary glands, the melanocortin peptides also have a diverse array of 
biological activities in other tissues, including the brain and immune system, and bind 
to specific receptors in these tissues with a distinct pharmacology (see, Hanneman et ah, 
in Peptide Hormone as Prohormones, G. Martinez, ed. (Ellis Horwood Ltd.: Chichester, 
UK) pp. 53-82; DeWied & Jolles, 1982, Physiol Rev. £2: 976-1059 for reviews). A 
complete understanding of these peptides and their diverse biological activities requires 
the isolation and characterization of their corresponding receptors. Some biochemical 
studies have been reported in the prior art. 

Shimuze, 1985, Yale J. Biol Med, 5&: 561-570 discusses the physiology of 
melanocyte stimulating hormone. 

Tatro & Reichlin, 1987, Endocrinology 121: 1900-1907 disclose that MSH 
receptors are widely distributed in rodent tissues. 

Solas/ a/., l9S9,J.Biol Chem.TM:. 14277-14280 disclose the molecular weight 
characterization of mouse and human MSH receptors linked to radioactively and 
photoaffinity labeled MSH analogues. 

Siegrist et a/., 1991, J. Receptor Res. H: 323-331 disclose the quantification of 
receptors on mouse melanoma tissue by receptor autoradiography. 

Cone & Mountjoy, U.S. Patent No. 5,532,347 disclose the isolation of human 
and mouse a-MSH receptor genes and uses thereof (incorporated herein by reference). 
Cone & Mountjoy, U.S. Patent No. 5,280,1 12 disclose the isolation of human 

" "^"bpvuieXCT 

Mountjoy et al, 1992, Science 252: 1248-1251 disclose the isolation of cDNAs 
encoding mammalian ACTH and MSH receptor proteins. 

POMC neurons are present in only two regions of the brain, the arcuate nucleus 
of the hypothalamus, and the nucleus of the solitary tract of the brain stem. Neurons 
from both sites project to a number of hypothalamic nuclei known to be important in 
feeding behavior, including the paraventricular nucleus, lateral hypothalamic area, and 
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^ ventromedial hypothalamic nucleus. While previous reports have claimed both 

stimulatory and inhibitory effects of a-MSH on feeding behavior (see Shimizu et aL, 
- 1989, Life ScL 45: 543-552; Tsujii et a/., 1989, Brian Res. Bull 165-169), 
knowledge of specific melanocortin receptors, their location within the central nervous 
5 system and the necessary pharmacological tools were not sufficiently developed at that 

time to allow the resolution of this issue. The present inventors have shown herein that 
a novel antagonist of the MC-3 and MC-4 melanocortin receptors can substantially 
increase food consumption in animals engaged in normal or fast-induced % feeding 
behavior. This is consistent with expression of both MC-3 and MC-4 receptor mRNAs 

10 at these sites in in situ hybridization studies (Roselli-Rehfiiss et aL, 1993, Proc. NatL 

Acad. ScL USA 8856-8860; Mountjoy et al., 1994, Molec. Endocrinol. £: 1298- 
1 308). Moreover, the regulation of arcuate nucleus POMC gene expression is consistent 
with an inhibitory role for POMC in feeding behavior. POMC mRNA levels arc 
decreased following a fast (Bergendahl et a/., 1992, NeuroendocrinoL 5£: 913-920; 

15 Brady et aL, 1990, NeuroendocrinoL 52: 441-447), and a significant diurnal variation 

in POMC mRNA levels in the arcuate nucleus is seen in rat, with the nadir occurring 
around the onset of nighttime feeding at 1 800 hrs (Steiner et aL, 1 994, FASEB J. £: 479- 
488). 

Thus, the experimental evidence indicates that POMC neurons play an important 
20 role in tonic inhibition of feeding behavior, wherein obesity results from a chronic 

disruption of this inhibitory tone by antagonism of central melanocortin receptors in at 
least one animal model (agouti). 

These results reveal for the first time a need in the art for a means for 
characterizing mammalian melanocortin receptor agonists and antagonists in vitro for 
25 the development of compounds that affect feeding behavior in animals, 



SUMMARY OF THE INVENTION 

The present invention provides a biological screening system for identifying and 
30 characterizing compounds that are agonists or antagonists of mammalian melanocortin 

receptors. The biological screening system of the invention comprises a panel of 
transformed mammalian cells comprising a recombinant expression construct encoding 
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"*> a mammalian melanocortin receptor, and expressing said receptor thereby. The 
invention provides such a panel of transformed mammalian cells wherein the panel 
comprises cells expressing each type of mammalian melanocortin receptor. Thus, the 
invention also provides nucleic acid encoding mammalian melanocortin receptors, 
recombinant expression constructs comprising said nucleic acid, and mammalian cells 
into which said recombinant expression constructs have been introduced, and that 
express functional mammalian melanocortin receptors. Methods for using such panels 
of melanocortin receptor-expressing mammalian cells to specifically detect and identify 
agonists and antagonists for each melanocortin receptor, as well as patterns of agonist 
and antagonist activity of said compounds for the class of melanocortin receptors, are 
also provided. Such screening methods provide a means for identifying compounds with 
patterns of melanocortin agonist and antagonist activity which is associated with the 
capacity to influence or modify metabolism and behavior in an animal, particularly 
feeding behavior. 

Thus, the invention provides in a first aspect a biological screening panel for 
determining the melanocortin receptor agonist/antagonist profile of a test compound. 
The panel comprises a first mammalian cell comprising a recombinant expression 
construct encoding a mammalian melanocortin receptor that is the a-MSH (MC-1) 
receptor. The panel also comprises a second mammalian cell comprising a recombinant 
expression construct encoding a mammalian melanocortin receptor that is the ACTH 
(MC-2) receptor. The panel also comprises a third mammalian cell comprising a 
recombinant expression construct encoding a mammalian melanocortin receptor that is 
the MC-3 receptor. The panel also comprises a fourth mammalian cell comprising a 
recombinant expression construct encoding a mammalian melanocortin receptor that is 
the MC-4 receptor. The panel also comprises a fifth mammalian cell comprising a 
recombinanrexpression construct encoding a mainmalfm ni that is 

the MC-5 receptor. As provided by the invention, each mammalian cell expresses the 
melanocortin receptor encoded by the recombinant expression construct comprising said 
cell. 

In preferred embodiments, the melanocortin receptors encoded by the 
recombinant expression constructs comprising the transformed mammalian cells 
comprising the panel are mouse MC-1 receptor (SEQ ID Nos.: 3 and 4); human MC-1 
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receptor (SEQ ID Nos.: 5 and 6), human MC-2 (ACTH) receptor (SEQ ID Nos.: 7 and 
8), bovine MC-2 receptor (SEQ ID Nos.: 9 and 10), rat MC-3 receptor (SEQ ID Nos.: 
1 1 and 12), human MC-4 receptor (SEQ ID Nos.: 15 and 16) and mouse MC-5 receptor 
(SEQ ID Nos.: Hand 18). 

In a second aspect, the invention provides a method for using the melanocortin 
receptor panel to identify and characterize test compounds as melanocortin receptor 
agonists and/or antagonists. In this embodiment, the method provided by the invention 
identifies a melanocortin receptor agonist, and comprises the steps of contacting each 
of the cells of the panel with a test compound to be characterized as an agonist of a 
mammalian melanocortin receptor and detecting binding of the test compound to each 
of the mammalian melanocortin receptors by assaying for a metabolite produced in the 
cells that bind the compound. In a preferred embodiment, the detected metabolite is 
cAMP. 

In a preferred embodiment of this method, each of the cells of the panel of 
mammalian cells expressing mammalian melanocortin receptors further comprises a 
recombinant expression construct encoding a cyclic AMP responsive element (CRE) 
transcription factor binding site that is operatively linked to a nucleic acid sequence 
encoding a protein capable of producing a detectable metabolite. In preferred 
embodiments, said protein is (J-galactosidase, most preferably encoded by a nucleic acid 
comprising the recombinant expression construct identified as pCRE/fi-galactosidase (as 
disclosed in Chen et al. 9 1 994, AnalyU Biochem. 22&: 349-354). As provided by the 
invention, expression of the protein that produces the detectable metabolite is dependent 
on binding of the test compound to the melanocortin receptor expressed by each cell in 
the panel and the intracellular production of cAMP as a result. In this embodiment, 
cAMP production results in expression of a protein capable of producing a detectable 
metabolite, the protein most preferably being P-galactosidase. In preferred 
embodiments,; the rd&iwtab1e~me^ to produce a colored product. 

Thus, this embodiment of the invention provides a panel of melanocortin receptor- 
expressing cells whereby melanocortin hormone binding results in the production of a 
colored product in proportion to the extent of cAMP production in the cell as a result of 
hormone receptor binding. 
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In another embodiment of this aspect of the invention is provided a method for 
characterizing a compound as an antagonist of a mammalian melanocortin receptor. In 
this embodiment, the method comprises the steps of contacting each of the cells of the 
panel with an agonist of the mammalian melanocortin receptor in an amount sufficient 
to produce a detectable amount of a metabolite produced in the cells that bind the 
agonist, in the presence or absence of a test compound to be characterized as an 
antagonist of a mammalian melanocortin receptor, and detecting the amount of the 
metabolite produced in each cell in the panel in the presence of the test compound with 
the amount of the metabolite produced in each cell in the panel in the absence of the test 
compound. As provided by the assay, inhibition of the production of the detectable 
metabolite is used as an indication that the tested compound is a melanocortin receptor 
antagonist, which is further characterized quantitatively by the extent of said inhibition. 

In a preferred embodiment of this method, each of the cells of the panel of 
mammalian cells expressing mammalian melanocortin receptors further comprises a 
recombinant expression construct encoding a cyclic AMP responsive element (CRE) 
transcription factor binding site that is operatively linked to a nucleic acid sequence 
encoding a protein capable of producing a detectable metabolite. In preferred 
embodiments, said protein is p-galactosidase, most preferably encoded by a nucleic acid 
comprising the recombinant expression construct identified as pCRE/p-galactosidase. 
As provided by the invention, expression of the protein that produces the detectable 
metabolite is dependent on binding of the test compound to the melanocortin receptor 
expressed by each cell in the panel. In preferred embodiments, the detectable metabolite 
absorbs light to produce a colored product. Thus, this embodiment of the invention 
provides a panel of melanocortin receptor-expressing cells whereby melanocortin 
hormone binding results in the production of a colored product in proportion to the 
extent of cAMP^foducTioITm the cell as a result^Tiomohe receptor binding. 

The invention also provides melanocortin receptor agonists identified by the 
methods and using the screening panel of the invention. In preferred embodiments, the 
agonist is an agonist of the MC-3 mammalian melanocortin receptor. In other preferred 
embodiments, the agonist is an agonist of the MC-4 mammalian melanocortin receptor. 

The invention provides melanocortin receptor antagonists identified by the 
methods and using the screening panel of the invention. In preferred embodiments, the 
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antagonist is an antagonist of the MC-3 mammalian melanocortin receptor. In other 
preferred embodiments, the antagonist is an antagonist of the MC-4 mammalian 
melanocortin receptor. . • 

The invention also provides' methods for characterizing mammalian melanocortin 
receptor agonists for the capacity to modify or influence metabolism and feeding 
behavior in an animal. In a first aspect, the invention provides a method for 
characterizing melanocortin receptor MC-3 or MC-4 agonists as inhibitors of feeding 
behavior in an animal, the method comprising the steps of providing food to an animal 
that has been deprived of food for at least 12 hours, with or without administering to the 
animal an MC-3 or MC-4 receptor agonist of the invention, and comparing the amount 
of food eaten by the animal after administration of the MC-3 or MC-4 receptor agonist 
with the amount of food eaten by the animal without administration of the MC-3 or MC- 
4 receptor agonist. . 

In another aspect, the invention provides a method for characterizing a 
melanocortin MC-3 or MC-4 receptor antagonist as a stimulator of feeding behavior in 
an animal. In this embodiment, the method comprises the steps of providing food to an 
animal not deprived of food for at least 12 hours, with or without administering to the 
animal an MC-3 or MC-4 receptor antagonist, immediately prior to the onset of darkness 
or nighttime, and comparing the amount of food eaten by the animal after administration 
of the MC-3 or MC-4 receptor antagonist with the amount of food eaten by the animal 
without administration of the MC-3 or MC-4 receptor antagonist. 

Thus, the invention also provides methods for using certain of the melanocortin 
receptor agonists and antagonists for modifying feeding behavior in an animal. In a first 
aspect, the invention provides a method for stimulating feeding in an animal,- the method 
comprising administering to the animal an MC-3 or MC-4 receptor antagonist. In a 
preferred embodiment, the antagonists are administered systemically. In additional 
embodiments, Trie antagonists are administered intracerebroventricularly. 
, In another aspect, the invention provides a method for inhibiting feeding in an 
animal, the method comprising administering to the animal an MC-3 or MC-4 receptor 
agonist. In a preferred embodiment, the agonists are administered systemically. In 
additional embodiments, the agonists are administered intracerebroventricularly. 
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In yet another aspect, the invention provides mammalian melanocortin receptor 
agonists having the general, formula: 

A-B-C-D-E-F-G-amide 
wherein A is an aliphatic amino acid residue, including for example Leu, lie, Nle and 

5 Met, as well as analogues and substituted derivatives thereof; B is an acidic amino acid 

residue, including for example Asp and Glu; C is a basic amino acid residue, such as 
His; D is an aromatic amino acid residue having a D- conformation, including D-Phe, D- 
Tyr and substituted derivatives thereof; E is a basic amino acid residue, for example Arg, 
Lys, homoArg, homoLys, and analogues or substituted derivatives thereof; F is Trp or 
10 substituted derivatives thereof; and G is Lys, homoLys or a substituted derivative 

thereof. In the peptide embodiments of the melanocortin receptor agonists of the 
invention, the peptide is cyclized by the formation of an amide bond between the side 
chain carboxyl group of the Asp or Glu residue at position B in the peptide, and the side 
chain amino group of the Lys or homoLys residue at position G. In preferred 

15 embodiments, the melanocortin receptor agonists of the invention are agonists of the 

MC-3 or MC-4 receptor. 

The invention also provides mammalian melanocortin receptor antagonists 

having the general formula: 

A-B-C-D-E-F-G-amide 

20 wherein A is an aliphatic amino acid residue, including for example Leu, He, Nle and 

Met, as well as analogues and substituted derivatives thereof; B is an acidic amino acid 
residue, including for example Asp and Glu; C is a basic amino acid residue, such as 
His; D is an aromatic amino acid residue having a D- conformation, including D-Nal and 
substituted derivatives thereof; E is a basic amino acid residue, for example Arg, Lys, 
25 homoArg, homoLys, and analogues or substituted derivatives thereof; F is Trp or 

substituted deriva tives thCTeo-fT^'G is-Lys, homoLys or a substituted derivative 

thereof. In the peptide embodiments of the melanocortin receptor antagonists of the 
invention, the peptide is cyclized by the formation of an amide bond between the side 
chain carboxyl group of the Asp or Glu residue at position B in the peptide, and the side 
chain amino group of the Lys or homoLys residue at position G. In preferred 
embodiments, the melanocortin receptor antagonists of the invention are agonists of the 
MC-3 or MC-4 receptor. 

- 8 - 
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It is an advantage of the present invention that it provides an in vitro screening 
method for characterizing compounds having melanocortin receptor activities that relate 
to feeding behavior in animals. Specifically, the invention advantageously provides 
means and methods for identifying compounds having melanocortin receptor agonist 
and/or antagonist activity that have been associated with either stimulating or inhibiting 
feeding behavior when administered to an animal. The invention thus provides an 
economical first step in screening compounds for the capacity to affect feeding behavior, 
including synthetic, peptidomimetic or organomimetic derivatives of melanocortin 
receptor agonists or antagonists as disclosed herein or elsewhere. 

Specific preferred embodiments of the present invention will become evident 
from the following more detailed description of certain preferred embodiments and the 
claims. 



DESCRIPTION OF THE DRAWINGS 

Figures 1 A and IB illustrate the nucleotide (SEQ ID No.: 3) and amino acid 
(SEQ ID No.: 4) sequence of the mouse melanocyte stimulating hormone receptor gene. 

Figures 2A and 2B illustrate the nucleotide (SEQ ID No.: 5) and amino acid 
(SEQ ID No.: 6) sequence of the human melanocyte stimulating hormone receptor gene. 

Figures 3A and 3B illustrate the nucleotide (SEQ ID No.: 7) and amino acid 
(SEQ ID No.: 8) sequence of the human adrenocorticotropic stimulating hormone + 
receptor gene. 

Figures 4A and 4B illustrate the nucleotide (SEQ ID No.: 9) and amino acid 
(SEQ ID No.: 10) sequence of the bovine adrenocorticotropic stimulating hormone 
receptor gene. 

Figures 5A and 5B illustrate the nucleotide (SEQ ID No.: 1 1) and amino acid 
(SEQ ID No.: 1 2) sequence of the rat melanocortin- 3 receptor gene. 

Figures 6A and 6B illustrate the nucleotide (SEQ ID No.: 15) and amino acid 
(SEQ ID No.: 1 6) sequence of the human melanocortin-4 receptor gene. 

Figures 7A and 7B illustrate the nucleotide (SEQ ID No.: 17) and amino acid 
(SEQ ID No.: 1 8) sequence of the mouse melanocortin-5 receptor gene. 
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Figure 8 shows a graph of intracellular cAMP accumulation resulting from 
melanocyte stimulating hormone receptor agonist binding in human 293 cells transfected 
with a MSH receptor-encoding recombinant expression construct, wherein represents 
binding of NDP-MSH, -o- represents binding of ACTH and -A- represents binding of 
5 aMSH. 

Figure 9 illustrates the cAMP response of mouse Yl cells to binding of 
melanocortin peptides to human melanocortin-2 (ACTH) receptor, as measured by the 
p-galactosidase assay described in Example 4, wherein -■- represents binding to wild- 
type ACTH-R and -A- represents binding to an ACTH-R variant. 

10 Figure 10 illustrates the results of competition binding experiments of 

melanocortin peptides to cells expressing a recombinant expression construct encoding 
the rat melanocortin-3 receptor, wherein -■- represents binding of NDP-MSH, -A- 
represents binding of yMSH, represents binding of aMSH, -o- represents binding 
of ACTH^o ^ -d- represents binding of ORG2766. 

15 Figures 11A through 11C illustrate the results of experiments showing 

intracellular cAMP accumulation caused by receptor-Iigand binding in human 293 cells 
expressing the MC-3 receptor. In Figure 1 1A, represents binding of aMSH, -■- 
represents binding of y 2 -MSH, -A- represents binding of des-acetyl aMSH and 
represents binding of ACTH ( _ 39 . In Figure 1 IB, -•- represents binding of YrMSH, 

20 represents binding of y 2 - msh ^d represents binding of des-acetyl ^ -MSH. In 

Figure 1 1C, represents binding of ACTH^o, -■- represents binding of NDP-MSH 
and -A- represents binding of ORG2766. 

Figure 12 shows a graph of intracellular cAMP accumulation resulting from 
peptide binding to human melanocortin-4 receptor agonist in human 293 cells 

25 transfected with a MC-4 receptor-encoding recombinant expression construct, wherein 

-D- represents binding of ACTH^ub represents binding of ACTHi. 39 , 41- represents 
binding of NDP-MSH, -o- represents binding of aMSH, -A- represents binding of y 2 - 
MSH, and -A- represents binding of des-acetyl aMSH. 

Figure 13 illustrates the results of cAMP accumulation and cAMP-dependent (3- 

30 galactosidase assays of melanocortin peptide binding to a rat melanocortin-5 receptor, 

wherein -□- represents binding of aMSH, -A- represents binding of p-MSH, and -o- 



- 10- 



WO 98/10068 



PCT/US97/15565 



represents binding of y-MSH, each determined using the p-gal method, and wherein -■- 
represents binding of aMSH, -A- represents binding of p-MSH, and -•- represents 
binding of y-MSH, each determined using the cAMP method. 

Figure 14 illustrates the structure of the pCRE/ P-gal plasmid. 
5 Figure 15 illustrates the results of the P-galactosidase-coupled, colorimetric 

melanocortin receptor binding assay using cells expressing each of the MC- 1 , MC-3, 
- MC4 or MC-5 receptors and contacted with aMSH or a variety of aMSH analogues, 
wherein -■- represents binding of aMSH, -A- represents binding of NDP-MS H, -•- 
represents binding of SHU9128 (para-Fl substituted), -□- represents binding of 
10 SHU9203 O-Cl substituted), -A- represents binding of SHU8914 (p-I substituted), and 

-o- represents binding of SHU91 19. 

Figures 16A through 16 D show the results of the P-galactosidase-coupled, 
colorimetric melanocortin receptor binding assay to determine antagonist activity of 
melanocortin analogues SHU9119 and SHU8914 in cells expressing each of the 
15 melanocortin receptors MC-3 and MC-4. In Figure 1 6A, -■- represents binding of 

aMSH, -□- represents binding of lOOnM SHU91 19, -A- represents binding of lOnM 
SHU91 1 9, and -o- represents binding of 1 nM SHU91 1 9. In Figure 1 6B, -■- represents 
binding of aMSH, -□- represents binding of 1 OOnM SHU91 1 9, -A- represents binding 
of 50nM SHU91 19, and -o- represents binding of lOnM SHU91 19. In Figure 16C, -■- 
20 represents binding of aMSH, -□- represents binding of lOOOnM SHU8914, -A- 

represents binding of lOOnM SHU8914, and -o- represents binding of lOnM SHU8614. 
In Figure 16D, -■- represents binding of aMSH, -□- represents binding of l OOnM 
SHU8914, -A- represents binding of 50nM SHU8914, and -o- represents binding of 
lOnM SHU8614. 

25 Figure 17 shows the results of classic competition binding assays using the 

melanocortin analogues SHU91 19 and SHU8914 at the MC3-R and MC-4 R receptors, 
wherein -■- represents binding of NDP-MSH, -A- represents binding of SHU8914 (p-l 
substituted), and - o - represents binding of SHU91 19. 

Figures 18A and 18B shows the results of cAMP accumulation experiments 

30 (performed using the P-galactosidase assay of Example 4) for rat MC-3 receptor (Figure 

18A) and for mouse MC-4 receptor (Figure 18B). In Figure 18 A, -M- represents 
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binding of NDP-MSH, -A- represents binding of MTII and represents binding of 
forskolin. In Figure 18B, -■- represents binding of MTII, -A- represents binding of 
NDP-MSH and -▼- represents binding of forskolin. 

Figures 19A through 19C show the effect on food intake of 
intracerebroventricular administration of melanocortin analogue SHU91 19 in mice. In 
Figure 19A, represents administration of acsf (n==7) and represents 
administration of 6nmol of SHU9119 (n=6). In Figure 19B, -■- represents 
administration of acsf (n=6) and represents administration of 6nmol of SHU91 19 
(n^tf). In Figure 19C, -o- represents administration of acsf (n=l 1) and represents 
administration of 6nmol of SHU9119 (n=12). 

Figures 20A through 20C show the effect on food intake of 
intracerebroventricular administration of melanocortin analogue MTII in mice. In 
Figure 20A, represents administration of acsf (n=8), -V- represents administration 
of O.lnmol MTII (n=8), represents administration of lnmol MTII (n=7) and -A- 
represents administration of 3nmol MTII (n==9). In Figure 20B, represents 
administration of acsf (n=12), represents administration of 3nmol MTII and 6nmol 
SHU91I9 (n=9) and -A- represents administration of 3nmol MTII (n=9). 

Figure 20D shows the effect on locomotor activity of intracerebroventricular 
administration of melanocortin analogue MTII in mice, wherein represents 
administration of vehicle alone (n=6) and -A- represents administration of 3nmol MTII 
(n=6). 

Figures 21 A through 2 ID show the effect on food intake of 
intracerebroventricular administration of melanocortin analogue MTII in mice. In 
Figure 21 A, represents administration of acsf (n=6) and -A- represents 
administra tion of 3nmol MTII (n=7). In F igure 21 B, open bar s represen t administration 
of acsf (n=6), solid bars represents administration of 1.18nmol neuropeptide Y (NPY; 
n=6) and stipled bars represents administration of 3nmol MTII and 1.18nmol NPY 
(n=6). In Figure 21C, -•- represents administration of acsf (n=7) and -A- represents 
administration of 3nmoI MTII (n=7). In Figure 21 D, -■- represents administration of 
lOOnrnol MTII (n=6) and -A- represents administration of vehicle alone (n=6). 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The term **melanocortin receptor" as used herein reference to proteins having the 
biological activity of any of the disclosed melanocortin receptors, including the,MC-l 
(SEQ ID Nos.: 3, 4, 5 and 6), MC-2 (ACTH; SEQ ID Nos.: 7, 8, 9 and 10), MC-3 (SEQ* 
ID Nos.: 1 1 and 12), MC-4 (SEQ ID Nos.: 15 and 16) or MC-5 (SEQ ID Nos.: 1 7 and 
18) receptors, as well as naturally-occurring and genetically-engineered allelic variations 
in these sequences. 

Cloned nucleic acid provided by the present invention may encode MC receptor 
protein of any species of origin, including, for example, mouse, rat, rabbit, cat, and 
human, but preferably the nucleic acid provided by the invention encodes MC receptors 
of mammalian, most preferably rodent and human, origin. 

The production of proteins such as the MC receptors from cloned genes by 
genetic engineering means is well known in this art. The discussion which follows is 
accordingly intended as an overview of this field, and is not intended to reflect the full 
state of the art. 

DNA which encodes MC receptors may be obtained, in view of the instant 
disclosure, by chemical synthesis, by screening reverse transcripts of mRNA from 
appropriate cells or cell line cultures, by screening genomic libraries from appropriate 
cells, or by combinations of these procedures, as illustrated below. Screening of mRNA 
or genomic DNA may be carried out with oligonucleotide probes generated from the MC 
receptor gene sequence information provided herein. Probes may be labeled with a 
detectable group such as a fluorescent group, a radioactive atom or a chemiluminescent 
group in accordance with know procedures and used in conventional hybridization 
assays, as described in greater detail in the Examples below. In the alternative, MC 
receptor gene sequences may be obtained by use of the polymerase chain reaction (PCR) 
procedure, with the PCR oligonucleotide primers being produced from the MC receptor 
gene sequences provided herein. See U.S. Patent Nos. 4,683,195 to Mullis et aL and 
4,683,202 to Mullis. 

MC receptor proteins may be synthesized in host cells transformed with a 
recombinant expression construct comprising a nucleic acid encoding each of the 
receptors disclosed herein. Such a recombinant expression construct can also be 
comprised of a vector that is a replicable DNA construct. Vectors are used herein either 
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\ to amplify DNA encoding an MC receptor and/or to express DN A which encodes an MC 
receptor. For the purposes of this invention, a recombinant expression construct is a 
replicable- DNA construct in. which a. DNA sequence encoding an MC receptor is 
operably linked to suitable control sequences capable of effecting the expression of the 
receptor in a suitable host cell. The need for such control sequences will vary depending 
upon the host selected and the transformation method chosen. Generally, control 
sequences include a transcriptional promoter, an optional operator sequence to control 
transcription, a sequence encoding suitable mKNA ribosomal binding sites, and 
sequences which control the termination of transcription and translation. Amplification 
vectors do not require expression control domains. All that is needed is the ability to 
replicate in a host, usually conferred by an origin of replication, and a selection gene to 
facilitate recognition of transformants. See, Sambrook et al., 1990, Mpjepujar Cloning; 
ft T .aWatnrv Manual (Cold Spring Harbor Press: New York). 

Also specifically provided by the invention are reporter expression constructs 
comprising a nucleic acid encoding a protein capable of expressing a detectable 
phenotype, such as the production of a detectable reporter molecule, in a cell expressing 
the construct. Such constructs can be used for producing recombinant mammalian cell 
lines in which the reporter construct is stably expressed. Most preferably, however, the 
reporter construct is provided and used to induce transient expression over an 
experimental period of from about 18 to 96 hrs in which detection of the reporter 
protein-produced detectable metabolite comprises an assay. Such reporter expression 
constructs are also provided wherein induction of expression of the reporter construct 
is controlled by a responsive element operatively linked to the coding sequence of the 
reporter protein, so that expression is induced only upon proper stimulation of the 
responsive element- Exemplary of such a responsive element is a cAMP responsive 
element (Cl^Vivhlch induces expression of me^6rter protein as a result of an 
increase in intracellular cAMP concentration. In the context of the present invention, 
such a stimulus is associated with melanocortin receptor binding, so that a reporter 
construct comprising one or more CREs is induced to express the reporter protein upon 
binding of a receptor agonist to a MC receptor in a recombinantly transformed 
mammalian cell. Production and use of such a reporter construct is illustrated below in 
Example 5. 

- 14- 
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Vectors useful for practicing the present invention include plasmids, viruses 
(including phage), retroviruses, and intertable DNA fragments (/.c.; fragments 
integratable into the host genome by homologous recombination). The vector replicates 
and functions independently of the host genome, or may, in some instances, integrate 
into the genome itself. Suitable vectors will contain replicon and control sequences 
which are derived from species compatible with the intended expression host. A 
preferred vector is the plasmid pcDNA/neo I. Transformed host cells are cells which 
have been transformed or transfected with recombinant expression constructs made 
using recombinant DNA techniques and comprising mammalian MC receptor-encoding 
sequences. Preferred host cells are human 293 cells. Preferred host cells for the MC-2 
(ACTH) receptor are Yl cells (subclone OS3 or Y6). Transformed host cells are chosen 
that ordinarily express functional MC receptor protein introduced using the recombinant 
expression construct. When expressed, the mammalian MC receptor protein will 
typically be located in the host cell membrane. See, Sambrook et al., ibid. 

Cultures of cells derived from multicellular organisms are a desirable host for 
recombinant MC receptor protein synthesis. In principal, any higher eukaryotic cell 
culture is workable, whether from vertebrate or invertebrate culture. However, 
mammalian cells are preferred, as illustrated in the Examples. Propagation of such cells 
in cell culture has become a routine procedure. See Tissue Culture Academic Press, 
Kruse & Patterson, editors (1 973). Examples of useful host cell lines are human 293 
cells, VERO and HeLa cells, Chinese hamster ovary (CHO) cell lines, mouse Yl 
(subclone OS3), and WI138. BHK, COS-7, CV, and MDCK cell lines. Human 293 
cells are preferred. 

Cells expressing mammalian MC receptor proteins made from cloned genes in 
accordance with the present invention may be used for screening agonist and antagonist 
compounds for MC receptor activity. Competitive binding assays are well known in the 
artahdaredescrib'^^^^ 

of MC receptor agonist and antagonist compounds, as detected in receptor binding 
assays as described below. 

One particular use of such screening assays are for developing drugs and other 
compounds useful in modifying or changing feeding behavior in mammals. The 
invention provides an assay system, comprising a panel of recombinant mammalian 
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cells, heterologously expressing each of the MC receptors disclosed herein, wherein the 
panel is constructed of at least one cell line expressing an MC receptor, and most 
preferably comprising cells expressing each of the MC receptors. The invention 
provides such panels also comprising a detection means for detecting receptor agonist 
5 or antagonist binding, such as the reporter expression constructs described herein, using 

direct binding and competition binding assays as described in the Examples below. In 
the use of this panel, each MC receptor is assayed for agonist or antagonist patterns of 
binding a test compound, and a characteristic pattern of binding for all MC receptors is 
thereby determined for each test compound. This pattern is then compared with known 

10 MC receptor agonists and antagonists to identify new compounds having a pattern of 

receptor binding activity associated with a particular behavioral or physiological effect. 

For example, provided herein is experimental evidence that MC-3 or MC-4 
receptor antagonists are capable of stimulating feeding in hungry animals, and that MC-3 
or MC-4 agonists are capable of inhibiting feeding in animals otherwise stimulated to 

15 eat. The invention provides an in vitro assay to characterize MC-3 and MC-4 

agonists/antagonists as a preliminary and economical step towards developing feeding 
behavior-modulating drugs for use in viva. 

These results on feeding behavior in vivo have been obtained with certain MC 
receptor binding analogues, SHU91 1 9 and MTII. These compounds have the following 

20 chemical structure: 

O 



25 



30 




SHU-9119 
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Generally, those skilled in the art will recognize that peptides as described 
herein may be modified by a variety of chemical techniques to produce compounds 
having essentially the same activity as the unmodified peptide, and optionally having 
other desirable properties. For example, carboxylic acid groups of the peptide, 
whether carboxyl-terminal or sidechain, may be provided in the form of a salt of a 
pharmaceutically-acceptable cation or esterified to form a C,-C 16 ester, or converted 
to an amide of formula NR,R 2 wherein R, and R 2 are each independently H or C r C I6 
alkyl, or combined to form a heterocyclic ring, such as 5- or 6-membered. Amino 
groups of the peptide, whether amino-terminal or sidechain. may be in the form of a 
pharmaceutically-acceptable acid addition salt, such as the HCI. HBr. acetic, benzoic, 
toluene sulfonic, maleic, tartaric and other organic salts, or may be modified to C,-C I6 
alkyl or dialkyl amino or further converted to an amide. Hydroxyl groups of the 
peptide si dechain may be converte d to C r C„ alkoxy or to a C,-C M ester using well- 
recognized techniques. Phenyl and phenolic rings of the peptide sidechain may be 
substituted with one or more halogen atoms, such as fluorine, chlorine, bromine or 
iodine, or with C,-C l6 alkyl, Q-C 16 alkoxy. carboxylic acids and esters thereof, or 
amides of such carboxylic acids. Methylene groups of the peptide sidechains can be 
extended to homologous C 2 -C 4 alkylenes. Thiols can be protected with any one of a 
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number of well-recognized protecting groups, such as acetamide groups. Those skilled 
in the art will also recognize methods for introducing cyclic structures into the peptides 
of this invention to select and provide conformational constraints to the structure that 
result in enhanced binding and/or stability. For example, a carboxyl-terminal or 
amino-terminal cysteine residue can be added to the peptide, so that when oxidized the 
peptide will contain a disulfide bond, thereby generating a cyclic peptide. Other 
peptide cyclizing methods include the formation of thioethers and carboxyl- and 
amino-terminal amides and esters. 

Peptidomimetic and organomimetic embodiments are also hereby explicitly 
declared to be within the scope of the present invention, whereby the three- 
dimensional arrangement of the chemical constituents of such peptido- and 
organomimetics mimic the three-dimensional arrangement of the peptide backbone and 
component amino acid sidechains in the peptide, resulting in such peptido- and 
organomimetics of the peptides of this invention having substantial biological activity. 
It is implied that a pharmacophore exists for the receptor agonist and antagonist 
properties of these and related MC receptor binding analogues. A pharmacophore is 
an idealized, three-dimensional definition of the structural requirements for biological 
activity. Peptido- and organomimetics can be designed to fit each pharmacophore with 
current computer modeling software (computer aided drug design). MC receptor 
binding analogues derived using such software and comprising peptido- and 
organomimetics of SHU91 19 and MTII and related analogues are within the scope of 
the claimed invention. 

The MC receptor binding analogues, in particular those analogues that are MC-3 
or MC-4 receptor agonists or antagonists are provided to be used in methods of 
influencing, modifying or changin g fee ding behavior in mammals in v ivo. S pecific 
examples of uses for the MC receptor binding analogues of the invention include but are 
not limited to treatment of eating disorders such as anorexia and obesity, and other 
pathological weight and eating-related disorders. Other examples are failure to thrive 
disorders and disease-related cachexia, such as occurs in cancer patients. Also within 
the scope of the analogues of the invention is use for enhancing appearance, athletic 
ability, or adjuvant to other therapies to treat disorders such as high blood pressure, high 
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serum cholesterol, vascular and heart disease, stroke, kidney disease, diabetes and other 
metabolic disorders.- » ... 

, The Examples which follow are illustrative of specific embodiments of the 
invention, and various uses thereof. They set forth for explanatory-purposes only, and 
are not to be taken as limiting the invention. 



EXAMPLE 1 
Isolation of an aMSH Receptor Probe by Random 
PCR Amplification of Human Melanoma cDNA Using 
Degenerate Oligoniici«»nti de Prinze 

In order to clone novel G-protein coupled receptors, cDNA prepared from RNA 
from human melanoma cells was used as template for a polymerase chain reaction 
(PCR)-based random cloning experiment. PCR was performed using a pair of 
degenerate oligonucleotide primers corresponding to the putative third and sixth 
transmembrane regions of G-protein coupled receptors (Libert et al., 1989, Science 2&; 
569-72; Zhou et al., 1990, Nature ML: 76-80). The PCR products obtained in this 
experiment were characterized by nucleotide sequencing. Two novel sequences 
representing novel G-protein-coupled receptors were identified. 

PCR amplification was performed as follows. Total RNA was isolated from a 
human melanoma tumor sample by the guanidinium thiocyanate method (Chirgwin ei 
al., 1979, Biochemistry 1&: 5294-5299). Double-stranded cDNA was synthesized from 
total RNA with murine reverse transcriptase (BRL, Gaithersburg, MD) by oligo-dT 
priming (Sambrook et al., ibid.). The melanoma cDNA mixture was.then subjected to 
45 cycles of PCR amplification using 500 picomoles of degenerate oligonucleotide 
primers having the following sequence: 
Primer III (sense): 

GAGTCGACCTGTG(Cyr)G(<yn(C/G)AT(C^)(A/G)CIIT(G/T)GAC(C/A)G(C/G)TAC 
~ " ~ r ~ (SEQIDNO:l) 

and 

Primer VI (antisense): . 

CAGAATTCAG(T/A)AGGGCAICCAGCAGAI(G/C)(G/A)(T/C)GAA 

(SEQIDNO:2) 
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in 100 pjil of a solution containing 50 mM Tri?-HCl (pH 8.3), 2.5 mM MgCI 2 , 0.01% 
gelatin, 200 n,M each dNTP, and 2.5 units of Tag polymerase (Saiki et aL, 1988, 
Science 222: 487-491). These primers were commercially synthesized by Research 
Genetics Inc. (Huntsville, AL). Each PCR amplification cycle consisted of incubations - 
at 94°C for 1 min (denaturation), 45 C for 2 min (annealing), and°72 C for 2 min 
(extension). 

Amplified products of the PCR reaction were extracted with phenol/chloroform 
and precipitated with ethanol. After digestion with EcdBl and Sail, the PCR products 
were separated on a 1.2% agarose gel. A slice of this gel, corresponding to PCR 
products of 300 basepairs (bp) in size, was cut out and purified using glass beads and 
sodium iodide, and the insert was then cloned into a pBKS cloning vector (Stratagene, 
LaJolla, CA). 

A total of 1 72 of such pBKS clones containing inserts were sequenced using 
Sequenasc (U.S. Biochemical Corp., Cleveland, OH) by the dideoxynucleotide chain 
termination method (Sanger et a/., 1977, Proc. Natl Acad, Set USA 2A: 5463-5467). 
Two types of sequences homologous to other G-protein coupled receptors were 
identified. 

EXAMPLE 2A 
Isolation of a Mouse ccMSH (MC-l) Receptor cPNA 
Probes isolated in Example 1 was used to screen a Cloudman melanoma cDNA 
library in order to isolate a full-length cDNA corresponding to the cloned probe. One 
clone was isolated from a library of 5 x 10 6 clones screened as described below in 
Example 2B. This clone contained an insert of 2.6 kilobases (kb). The nucleotide 
sequence of the complete coding region was determined (see co-owned U.S. Patent No. 

H533Y t 347rfico^ 

region was sequenced and is shown in Figures 1 A and IB (SEQ ID Nos: 3 & 4). 

EXAMPLE 2B 
^nlation of a Human aMSH fMC-n Receptor cDNA 
In order to isolate a human counterpart of the murine melanocyte aMSH 
receptor gene disclosed in Example 2A and co-owned U.S. Patent No. 5,532,347, a 
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human genomic library was screened at high stringency (50% formamide, 42 °C) using 
the human PGR fragments isolated as described in Example 1. A genomic clone was 
determined to encode an human MSH receptor (SEQ ID NO:5). The human MSH 
receptor has a predicted amino acid sequence (SEQ ID NO:6) that is 75 % identical and 
colinear with the mouse ocMSH receptor cDNA sequence (Figures 2 A and 2B, 
represented as human MSH-R). The predicted molecular weight of the human MSH R 
is 34.7kD. 

EXAMPLE 2C 
Isolation of a Human ACTH fMC-2^ R eceptor cDNA 
For cloning the ACTH receptor (MC-2), a human genomic library was 
screened at high stringency (50% formamide, 1M NaCl, 50nM Tris-HCI, pH 7.5, 
0.1 % sodium pyrophosphate, 0.2% sodium dodecyl sulfate, lOO^g/ml salmon sperm 
DNA, 10X Denhardt's solution, 42 °C), using the human PCR fragments isolated as 
described in Example 1 herein and U.S. Patent No. 5,280,112, incorporated by 
reference. A genomic clone was isolated that encodes a highly related G-coupled 
receptor protein (SEQ ID NO:7 and Figures 3 A and 3B). The predicted amino acid 
sequence (SEQ ID NO:8) of this clone is 39% identical and also colinear, excluding 
the third intracellular loop and carboxy-terminal tail, with the human MSH receptor 
gene product. The predicted molecular weight of this putative ACTH R is 33.9 
kilodaltons (kD). This clone was identified as encoding an MC-2 receptor based on 
its high degree of homology to the murine and human MSH receptors, and the pattern 
of expression in different tissue types, as described in Example 3 in U.S. Patent 
5,280,112. 

EXAMPLE-ID— — 

Isolation of a Bovine A CTH fMC-21 Receptor cDNA 

A bovine genomic DNA clone encoding the bovine counterpart of the MC-2 
(ACTH) receptor was isolated from a bovine genomic library, essentially as described 
in Example 2C above, and its nucleotide sequence determined (as shown in Figures 
4A and 4B; SEQ ID Nos:9 & 10). 
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EXAMPLE 2E 
Isolation of a Rat v-MSH ( MC-3) Receptor cDNA 
The mouse aMSH receptor cDNA isolated as described in Example 2A and co- 
owned U S. Patent No. 5,532,347 was used to screen a rat hypothalamus cDNA library 
at low stringency (30% formamide, 5X SSC, 0.1% sodium pyrophosphate, 0.2% sodium 
dodecyl sulfate, 100jig/ml salmon sperm DNA, and 10% Denhardt's solution) at 42°C 
for 1 8h. A 1 kb cDNA clone was isolated and sequenced as described in co-owned U.S. 
Patent No. 5,532,347, and this clone used to re-screen the rat hypothalamus cDNA 
library at high stringency (same conditions as above except that formamide was present 
at 45%). A cDNA clone approximately 2.0 kb in length was isolated and analyzed as 
described in co-pending U.S. Application Serial No. 08/044,812, incorporated by 
reference; a portion of this cDNA comprising the coding region was sequenced and is 
shown in Figures 5 A and 5B (SEQ ID Nos:l 1 & 12), 

EXAMPLE 2F 

Isolation of a Human MC-4 Receptor DNA 

For cloning the MC-4 receptor, a human genomic library was screened at 

moderate stringency (40% formamide, 1M NaCl, 50mM Tris-HCl, pH 7.5, 0.1% 

sodium pyrophosphate, 0.2% sodium dodecyl sulfate, 100/ig/ml salmon sperm DNA, 

10X Denhardt's solution, 42°C), using rat PCR fragments isolated as described in 

Example 1 herein, with the exception that the following primers were used for PCR: 

Primer II (sense): 

GAGTCGACC(A/G)CCCATGTA(C/T)T(AGT)(C/T)TTCATCTG 

(SEQ ID NO: 13) 

and 

Primer VII (antisense): 

CAGAATTCGGAA(A/G)GC(A/G)TA(G/T)ATGA(A/G)GGGGTC 

(SEQIDNO:14) 

A genomic clone was isolated that encodes a highly related G-coupled receptor 
protein (SEQ ID NO: 15 and Figures 6A and 6B) on a 1.9kb HindUl fragment. The 
predicted amino acid sequence (SEQ ID NO: 16) of this clone is 55-61% sequence 
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identity with human MC-3 and MC-5 receptors, and 46-47% sequence identity with 
the human MC-1 and MC-2 (ACTH) receptor. 

. . EXAMPLE 2G 

Isolation of a Mouse MC-5 Receptor DN A , { 
One million clones from a mouse 129SVJ genomic library comprising 5,000,000 
clones in the AFixII vector (Stratagene) was screened at low stringency (hybridization 
in 40% formamide at 42 °C, washing performed in 0.5X SSC at 60°C, as described 
above in Example 2E) using radiolabeled probed from the rat MC-3 and MC-4 receptors 
(as described in Examples 2E and 2F). Positively-hybridizing clones were isolated and 
sequenced, and the sequences obtained were compared to previously-isolated 
melanocortin receptor clones. One clone, comprising a previously-unknown sequence, 
was determined to encode the MC-5 melanocortin receptor. The nucleotide and amino 
acid sequences of this receptor are shown in Figures 7A and 7B (SEQ ID No.; 17 & 18). 

EXAMPLE 3 

Construction of a Recombinant Expression Construct, DNA Transfection 
and Functional Expre ssion of the MCR Gene Products 

In order to produce recombinant mammalian cells expressing each of the 
melanocortin receptors of Example 2, cDNA from each receptor was cloned into a 
mammalian expression construct, the resulting recombinant expression construct 
transfected into human 293 cells, and cell lines generated that expressed the 
melanocortin receptor proteins in cellular membranes at the cell surface. 

The mouse ocMSH receptor was cloned by excising the entire coding region of 
the aMSH R (MC-1) cDNA insert comprising a 2.1kb fragment and subcloning this 
fragment into the BamRVXhol sites of pcDNAI/neo expression vector (Invitrogen, San 
Diego, CA). The resulting plasmid was prepared in large-scale through one cycle of 
CsCl gradient ultracentrifugation, and 20 \x% of the plasmid transfected into each 
100mm dish of 293 cells using the calcium phosphate method (see Chen & Okayama, 
1987, /. 2: 2745-2752). After transfection, cells were cultured in DMEM media 
supplemented with 10% calf serum in a 3% C0 2 atmosphere at 37 °C* Selection was 
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NDP-MSH > Y-MSH > a-MSH > ACTH^o »> ORG2766. 
Approximate ¥^ values derived from this experiment are as shown in Table I: 



TABLE I 


Agonist 


..Jkfrpprox) ... . 


NDP-MSH 


2x lO" 8 


Y-MSH 


5xl0- 8 


a-MSH 


1 x 10- 7 


ACTH^n 


8x 10- 5 



cAMP production assays as described above were also used to analyze 
expression of MC3-R in cells transfected with the expression vectors described herein 
as follows. Cells (~5xl0 6 ) were plated in 6-well dishes, washed once with DMEM 
containing 1% bovine serum albumin (BSA) and 0.5mM IBMX (a phosphodiesterase 
inhibitor), then incubated for lh at 37°C with varying concentrations of the melanotropic 
peptides aMSH, y,MSH, yMSH, the MSH peptide analogues Nle 4 -D-Phe 7 -oMSH 
(NDP-MSH), ACTH 4 . 10 and ACTH,.j 9 . Following hormone treatment, the cells were 
washed twice with phosphate buffered saline and intracellular cAMP extracted by lysing 
the cells with 1ml of 60% ethanol. Intracellular cAMP concentrations were determined 
using an assay which measures the ability of cAMP to displace [8- 3 H] cAMP from a 
high affinity cAMP binding protein (see Gilman, 1979, Proc. Natl. Acad. Sci. USA 6J: 
305-312). 

The results of these experiments are shown in Figures 1 1 A through 1 1G. The 
abscissa indicates the concentration of each hormone and the ordinate indicates the 
pgrentagf of basal intracellular cA MP concentration achieve d by each treatment. Points 
indicate the mean of duplicate incubations; the standard error did not exceed 1 5% for 
any data point. Figure 1 1 A depicts the results of experiments using peptides found in 
vivo\ Figure 11B depicts results found with y-MSH variants; and Figure 11C shows 
results of synthetic melanocortin analogues. None of the peptides tested induced any 
change in intracellular cAMP in cells containing the vector alone. Cells expressing rat 
MC3-R responded strongly to every melanotropic peptide containing the MSH sequence 
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His-Phe-Arg-Tip, with up to a 60-fold elevation of intracellular cAMP levels. EC 50 
values ranged from 1-50 nM: The most potent ligand and the one having the lowest 
EC 50 was found to be Y MSH. The order of potency' for the naturally occurring 
melanocortins was found to be: 

y r MSH = yMSH > aMSH = ACTH,.,, > y 3 -MSH > rfes-acetyl-aMSH > ACTH 4 l0 . 
Ec 50 values for these compounds are shown in Table II: 



TABLE II . 



Agonist 


..EC^ 


NDP-MSH 


1 x 10' 9 


Y.-MSH 


3x lO' 9 


y 2 -MSH 


3x 10" 9 


a-MSH 


4xl0" 9 


ACTH,. 59 


4xl0" 9 ' 


Y3-MSH 


6x lO" 9 


desacetyl-aMSH 


8xl0- 9 


ACTH,., 0 


1 x lO" 7 



Additionally, a synthetic melanocortin peptide (ORG2766), known to have the 
greatest activity in vivo in stimulation of retention of learned behavior and in stimulation 
of neural regeneration, was unable to stimulate MC3-R-mediated cAMP production, and 
was also inactive as an antagonist. The results strongly indicate that this peptide does 
not bind to MC3-R protein. 

The MC-4 receptor was cloned in a 1 .9kb HindUl genomic DNA fragment after 
PCR amplification of a lambda phage clone into pcDNAI/Neo (Invitrogen). This 
plasmid was stably introduced into human 293 cells by calcium phosphate co- 
precipitation using standard techniques, and plasmid-containing cells selected in G418 
containing media. Specificity of receptor-hormone binding was assayed using adenylate 
cylcase activity as described above. The MC-4 receptor was found to couple to 
adenylate cyclase activity having the following pattern of agonist affinity: 

NDP-MSH > des-acetyl-a-MSH >/= ACTH,. 39 >/= a-MSH > > y 2 -MSH = ACTH 4 . 10 
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whereas the synthetic ACTH 4 . 9 analogue ORG2766 showed no detectable binding to the 
MC-4 receptor. The results of adenylate cyclase activity assays are shown in Figure 12. 
EC 50 values for each of the tested MC-4 receptor agonists are as shown in Table 111: 



TABLE III 



Agonist 


_ Ecja 


NDP-MSH 


1.1 x 10"M 


</esacetyl-aMSH 


4.9 x 10- ,0 M 


ACTH,. 39 " 


6.8 x 10 ,0 M 


a-MSH 


1.5 x lO-'M 


Y 2 -MSH 


> 10" 7 M 


ACTH 4 . 10 


> io- 7 



A 1.6kb Apal-Hindlll fragment comprising the entire coding sequence of the 
mouse MC-5 melanocortin receptor disclosed in Example 2G above was cloned into the 
pcDNA/neo expression vector (Invitrogen) after PCR amplification of the lambda phage 
clone. This plasmid was stably introduced into human 293 cells by calcium phosphate 
co-precipitation using standard techniques, and plasmid-containing cells selected in 
G418 containing media. Specificity of receptor-hormone binding was assayed using 
adenylate cylcase activity as described above. The MC-5 receptor was found to couple 
to adenylate cyclase activity having the following pattern of agonist affinity: 

a-MSH > pMSH > > y-MSH 
The results of adenylate cyclase activity assays are shown in Figure 13. EC 30 values for 
each of the tested MC-5 receptor agonists are: a-MSH=1.7 x 10' 9 M; and PMSH = 5x 
10; 9 M. 



EXAMPLE 4 

Melanocortin Analogue Binding to Mammalian Melanocortin Receptors 
Recombinant cells prepared as described above in Example 3 were used to 
characterize receptor binding of two melanocortin analogues comprising cyclic lactam 
heptapeptides. 
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The melanocortin receptor analogue SHU9119 has the following chemical 



structure: 




Ac-Nle 4 -cyclo(Asp 5 , D-Nal(2) 7 , Lys 10 ) aMSH-(4- 1 0)-amide 



The melanocortin receptor analogue MTII has the following chemical 



structure: 




His 



Ac-Nle'-cyclo(Asp s , His', D-Phe\ Arg * Trp' , Lys'°) aMSH-(4-10)-amide 
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These analogues were prepared as described in Hruby et aL (1 995, J. Med. Chem, 

2£: 3454-3461). 

; . These analogues were tested for melanocortin receptor binding using a 

colorimetric assay system developed by some of the instant inventors (Chen et al. 9 1 995, 

Analyt. Biochem. 22& 349-354) as follows. A series of concatamers of the synthetic 

- oligonucleotide: 

\ ■ 5 '-GAATTCGACGTCACAGTATGACGGCCATGG-3 ' 

(SEQIDNo:19) 

was produced by self-annealing and ligation and a tandem tetramer obtained. This 
fragment was cloned upstream of a fragment of the human vasoactive intestinal peptide 
(-93-+ 152; SEQ ED No.: 13; see Fink et aL, 1 988, Proc. NatL Acad. ScL USA £5: 6662- 
6666). This promoter was then cloned upstream of the P-galactosidase gene from E. 
colL The resulting plasmid construct is shown in Figure 14. . 

Transient transfection of the pCRE/p-gal plasmid described above was 
performed as follows. Cells grown to between 40-60% confluency (corresponding to 
about 1.5 million cells/6cm tissue culture plate) were incubated with Opti-MEM 
(GIBCO-BRL, Long Island, NY) and then contacted with a pCRE/p-gal-lipofectin 
complex which was prepared as follows. 3\ig plasmid DNA and 20jiL lipofectin reagent 
(GIBCO) were each diluted into 0.5mL Opti-MEM media and then mixed together. This 
mixture was incubated at room temperature for 15-20 min., and then the mixture (lmL) 
added to each 6cm plate. Transfected plates were incubated at 37°C for 5-24h, after 
which the plates were washed and incubated with DMEM media (GIBCO) and the cells 
split equally into a 96-well culture plate. 

To assay melanocortin receptor analogue binding, human 293 cells expressing 
each of the melanocortin receptors MC-1, MC-3, MC-4 and MC-5, and mouse Yl cells 
expressing the MC-2 receptorrwere transiently transfected with pGRE/p-gal as described 
above and assayed as follows. Two days after transfection, cells were stimulated with 
hormones specific for each receptor or hormone analogue by incubation for 6h at 37 °C 
with a mixture comprising 1 0' 12 - 1 0 ^M) hormone or analogue, 0. 1 mg/mL bovine serum 
iribumm7md"0J DMEM. The effect of hormone or 

analogue binding was determined by p-galactosidase assay according to the method of 
Feigner et aL (1994, J. BioL Chem. 269 : 2550-2561). Briefly, media was aspirated from 
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culture wells and 50gL lysis buffer (0.25M Tris-HCl, pH 8/0.1% Triton-XlOO) added 
to each welbGell Iy§is was enhanced by one round of freezing and thawing the cell/lysis 
buffer mixture. 10|iL aliquots were sampled from each well for protein determination 
using a commercially-available assay (BioRad, Hercules,- CA). The remaining 40nL 
from each well was diluted with 40fiL phosphate bufTered saline/0.5% BSA and lSO^iL 
: substrate buffer (60mM sodium phosphate/ ImM MgCI 2 /J0mM KCI/ 5mM P- 
mercaptoethanol/ 2mg/mL o-nitrophenyl-p-D-galactopyranoside) added. Plates were 
incubated at 37°C for lh and then absorbance at 405nm determined using a 96-welI plate 
reader (Molecular Devices, Sunnyvale, CA). A series of two- fold dilutions from 20ng 
of purified P-galactosidase protein (Sigma Chemical Co, St. Louis, MO) were assayed 
in parallel in each experiment to enable conversion of OD 405 to known quantity of P- 
galactosidase protein. 

The results of these experiments are shown in Figure 15. This Figure shows the 
results of the P-galactosidase assay described above using cells expressing each of the 
MC-1 , MC-3, MC-4 or MC-5 receptors and contacted with aMSH or a variety of ccMSH 
analogues, including SHU91 19. These results showed that SHU91 19 had relatively 
weak agonist activity for both the human MC-3 and MC-4 receptors. 

These results demonstrated the development of a colorimetric assay for cAMP 
accumulation as the result of melanocortin receptor binding to agonists and antagonists. 

The action of MTII, SHU91 19, and the endogenous mouse agouti peptide as 
agonists or antagonists of rodent MC receptors was first determined by examining thei r 
- ability to elevate intracellular cAMP in 293 cell lines expressing the rat MC3-R or 
mouse MC4-R (expressed as IC J0 values representing ligand concentration required for 
half-maximal inhibition of binding of (M25)-(Nle 4 , D-Ph 7 e )a-MSH tracer). 
Agonist/antagonist activity was also shown by demonstrating inhibition of cAMP 
elevation by the potent a-MSH analogue [Nle 4 , D-Phe 7 ]a-MSH, using either a cAMP- 
responsive P-galactosidase reporter construcT as describe 

cyclase assay as described in Example 3 (wherein EC 5 o values represent ligand 
concentration required for half-maximal activation of a cAMP-responsivc p- 
galactosidase reporter). Competition binding experiments were determined as> the 
amount of radioactivity bound in the presence of 5xlO _6 M unlabeled [Nle 4 , D-Phe 7 ]a- 
MSH/and was typical ly 3-5% of total counts bound. 
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In these experiments, murine agouti peptide was produced using a baculovirus 
system as described by Lu et at. (1994, Nature 271; 799-802), with the modification that 
the agouti peptide was purified from baculovirus supematants by 0.6M NaCl step elution 
from an EconoS cation exchange column (BioRad). Agouti peptide used in these assays 
was approximately 60% pure. 

Competition binding assays were performed to determine whether SHU91 1 9 had 
antagonist activity towards aMSH binding to either the MC-3 or MC-4 receptors. These 
assays were performed as follows. Human 293 cells (100,000 cells/well in 24- well 
plates) expressing either the MC-3 or MC-4 receptors prepared as described above were 
incubated with a solution of lmg/mL BS A in PBS containing 1 OO.OOOcpm (3.1x1 0- ,0 M 
[ 125 I](Nle 4 , D-Phe')aMSH and varying concentrations of aMSH, (Nle 4 , D-Phe 7 )aMSH 
or SHTJ91 19. Cells were incubated for 30min at 37 °C, washed twice with PBS-BSA, 
lysed with 0.5mL 0.5N NaOH, and counted using a y-counter to quantitate the amount 
of bound [ ,JS I](Nle 4 , D-Phe^aMSH. Control experiments showed non-specific binding 
to occur at about 3-5% levels, and this was taken into account when analyzing the 
experimental results. 

The SHU91 19 analogue was found to be a potent antagonist of both the human 
MC-3 and MC-4 receptors, as shown in Figure 16. These assays showed pA 2 values of 
8.3 and 9.3 for the human MC-3 and MC-4 receptors, respectively, as determined using 
the method of Schild (1947, Brit. J. Pharmacol, g: 189-206). In contrast, no significant 
alteration in IC*, values was detected in binding experiments using this analogue with 
either the MC-3 or MC-4 receptors (Figure 17). 

The activity of the MTII analogue was also assayed for melanocortin receptor 
agonist activity. These results are shown in Figures 18A and 18B, and confirmed that 
the MTII analogue is a specific agonist of the MC-3 and MC-4 receptors. 

Specific competition ofTNle\D-Phela-MWWndmg to rat MC-3 "receptor by 
agouti peptide was observed, although accurate IC50 values could not be determined 
because the peptide preparation was not homogenous (results not shown). Specific 
competition of a-MSH activation of human MC4-R by agoun was previously disclosed 
(Lu et a/., 1994, Nature Hi: 799-802). 
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EXAMPLE 5 

Fad i ng Pgftavior Effect of Melanocortin An»ln gl , e Binding in w r n j n 

The.results shown in Example 4 above suggested a role in the regulation of 
, feeding behavior in mammalian brain for MC receptor agonists and antagonists^ in view 
of the antagonist properties of the agouti peptide at the MC-3 and MC-4 receptors. The 
agouti peptide was known to cause obesity when expressed ectopically in the mouse, and 
has been found to be a high affinity antagonist of the melanocyte stimulating hormone 
' receptor (MCI -R) and of the hypothalamic MC-4 receptor (see Lu et al., ibid]). The 
. former activity explained the inhibitory effect of the agouti peptide on eumelanin 
pigment synthesis. Similarly, it was hypothesized by the inventors that agouti causes 
obesity in mice by antagonizing hypothalamic MC-4 receptors. The cyclic melanocortin 
analogue, SHU91 19, having been shown herein and elsewhere (Hruby et al.) to be a 
specific, high affinity antagonist of the central MC-3 and MC-4 receptors, was tested for 
the effect of direct administration to mouse brain on feeding behavior in the animals. 
Intracerebroventricular (ICV) administration of SHU91 19 was performed to avoid any 
complications caused by inhibition of peptide traverse of the blood-brain barrier. 

Briefly, male C57B1/6J mice (18-29g) were maintained on a normal 12hr/12hr 
light dark cycle with food (Purina mouse chow) and water ad libitum. Animals were 
housed individually for 24 hrs, distributed into experimental and control groups, 
avoiding any bias as a function of prior weight, then injected with vehicle or vehicle plus 
drug just prior to the onset of a 12hr light or dark cycle. Fasted animals were deprived 
of food from 1 8:00 to 10:30 hrs to stimulate feeding during the daytime experimental 
period. Animals were lightly anesthetized with halothane, and administered into one 
lateral ventricle 2 uL of a solution of artificial cerebrospinal fluid alone (acsf, 
comprising 130mM NaCl, 27mM NaHCOj, 1.2mM Na HPQ , 0.3mM Na*H Pp , 
0.5mM NajSCv 1 OmM CaCl 2 , l.OmM MgCl 2 , and 2.5mM KC1), or 6nmol SHU91 19 
in acsf. Freehand injections were performed as described by Laursen and Belknap 
(1986, J. Pharmacol. Methods 1&: 355-357) with some modifications. A lOul luertip 
syringe (Hamilton 701LT) was fitted with a 0.5 inch 27 gauge needle. Stiff tygon tubing 
was slipped over the needle to expose 3mM of the needle tip. The syringe was held at 
a 45 ° angle from the front of the skull with the bevel facing up. The coronal suture was 
found by lightly rubbing the needle over the skull. Maintaining the 45 ° angle, the needle 
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was then inserted l-2mm lateral to the midline, using only mild pressure to insert and 
remove the needle. The compounds indicated in a 2\i\ volume of acsf were administered 
slowly over approximately 15 seconds, and the needle removed after 35 seconds. 
Animals were allowed to recover from anesthesia and placed into a cage containing a 
premeasured quantity of food pellets in a spill-free cup. Moribund animals were not 
included in the study. 

Stimulation of feeding by intracerebroventricular administration of the 
melanocortin antagonist SHU91 19 is shown in Figures 19A through 19C. Curves show 
cumulative food intake as a function of time following administration of the substances 
shown. Figure 1 9A shows stimulation of feeding by administration of SHU9 1 1 9 just 
prior to lights off (19:00 hrs) to C57B1/6J mice fed ad libitum. Figure 1 9B, in contrast, 
shows no effect of morning (10:00 hrs) SHU91 19 administration in C57B1/6J mice fed 
ad libitum. Figure 19C illustrates stimulation of daytime feeding by SHU9119 
administration in fasted C57B1/6J mice. In deriving the data points comprising these 
Figures, food remaining was briefly removed and weighted at the time intervals 
indicated. Data points indicate the mean and bars indicate standard error. Significance 
of the effect over time was determined by ANOVA with repeated measures. 
Significance of drug effects at individual time points was determined by two-way 
ANOVA, and is indicated in each Figure (***=P<0.00h **=P<0.01, *=P<0.05). 

These results demonstrated that ICV administration of SHU91 1 9 into one lateral 
ventricle of the C57B1/6J mouse just prior to lights out led to a mean 60% increase in 
food intake over 12 hrs (Figure 19A; P<0.005). In contrast, daytime food intake in 
animals fed ad libitum was not stimulated by administration of SHU91 19 (Figure 1 9B). 
SHU91 1 9-treatment did, however, significantly stimulate daytime food intake in animals 
fasted for 16 hrs prior to the experiment (Figure 19C; PO.001 ). Stimulation of feeding 
was evident at approximately two continued for 12 hrs, to 

produce a mean 34% in food intake relative to vehicle-injected controls. 

These results supported the hypothesis that agouti and/or SHU91 19 stimulate 
feeding by antagonizing MC receptors in the central nervous system. To further test this 
hypothesis, a'series of experiments were performed wherein MC receptor agonists were 
administered to animals primed by fasting to eat, to determine whether feeding in such 
animals could be inhibited by the MC receptor agonists. Animals were induced to feed 
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by food deprivation for 16h prior to ICV administration of the non-specific mefanocortin 
agonist MTII. In these experiments, ICV injections in male C57B 1/6J mice (20-30g) 
and the measurement of food intake were performed as described above. 

Results of these experiments are shown in Figures 20A through 20C. In 
comparison to vehicle-injected animals, MTII was found to produce a potent inhibition 
of feeding within one hour after administration (Figure 20A) in a dose-responsive 
manner. Food intake was significantly inhibited for up to four hours following 
administration (P<0.001) at the highest dose administered (3nmol), and decreased food 
intake continued for the next four hours with normal rates of food intake resuming at 
about 8 hours after treatment This dose-responsive inhibition of feeding had an IC*, at 
the two hour time point of approximately 0.5nmol (Figure 20B). However, inhibition 
of feeding with 3nmol MTII was completely blocked by co-administration of 6nmol 
SHU91 19 (Figure 20C; PO.001), demonstrating that the effect results specifically from 
agonist binding to the MC-4 and/or MC-3 receptor. 

Locomotor assays were performed to determine whether the effects on feeding 
behavior observed in these mice were secondary to generalized behavioral effects caused 
by administration of these melanocortin analogues. The effects of MTII on locomotor 
activity were tested by placing vehicle or MTII-treated mice in sound and light-proof 
cages containing multiple light beam detectors. These assays were performed by first 
injecting 3nmol MTII or acsf as described above. At three hours (2:45-3:25) post- 
injection, 12 mice were placed into 12 separate boxes containing multiple infrared light 
sources and photodetectors. The boxes were contained within separate ventilated light 
and sound attenuating chambers (Coulbourn model El 0-20). Disruption of the infrared 
beams, with a 10msec resolution, was tallied independently for each one minute time 
period in each cage. The results of these assays are shown in Figure 20D. Data points 
indicate the mean total activity (# of light breaks) for 6 animals in each experimental 
^up. Four way AWV A^ 

an absence of a significant difference among the two groups. 

Inhibition of feeding by MTII could not be explained by any apparent behavioral 
abnormalities, or any effect on arousal or locomotor activity. MTII-treated animals 
appeared alert and exhibited no unusual behavior relative to controls. At approximately 
three hours after ICV administration, MTII-treated animals exhibited locomotor activity 
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that was indistinguishable from vehicle-treated animals (Figure 20D).. The higher initial 
activity, indicative of exploratory behavior, and continued locomotion over a 15 min 
period was indistinguishable between the two groups, indicating that the inhibition of 
feeding was not due to decreased locomotion or decreased arousal. 

The administration of MTII also inhibited food intake in three other models of 
hyperphagia: the C57Bl/6J-Le/>°* mouse, a neuropeptide Y (NPY)-injected C57B1/6J 
mouse and a C57Bl/6J->4 > ' mouse. Figure 21 A shows inhibition of feeding by 
intracerebroventricular administration of MTII in A Y mice (females, 1 9-28gms). Figure 
21 B shows inhibition of feeding by intracerebroventricular administration of MTII in 
C57B1/6J mice (females, 2I-25gm) stimulated to feed by co-administration of NPY. 
Figure 21 C shows inhibition of feeding by intracerebroventricular administration of the 
MTII in ob/ob mice (females, 48-69 gms). Figure 21 D shows inhibition of feeding in 
ob/ob mice by intraperitoneal administration of MTII (females, 40-45 gms). ICV 
injections and measurement of food intake were performed as described above, with the 
exception of NPY treated animals, which were not fasted prior to experimentation. 
Animals treated intraperitoneal ly received \00\i\ of a ImM solution of MTII in saline, 
and vehicle injections consisted of the same amount of saline alone. Significance 
indicated for individual time points, determined as described above, was for 3nmol MTII 
vs. acsf (Figure 21 A), 1.18 nmol NPY vs. 1.18 nmol NPY + 3 nmol MTII (Figure 21B), 
3nmol MTII vs. acsf (Figure 21C), and 1 00 nmol MTII vs. saline (Figure 21D). 

The hyperphagia in these models can be clearly seen by comparing the 12 hr food 
intake following a fast in vehicle-injected C57B1/6J (2.4g, Figure 19A), C57Bl/6J-y4 r 
(3.7g, Figure 21 A) and C57B1/6J-Lep° 6 (3.7g. Figure 21C) animals. As expected, MTII 
treatment inhibited food intake following a 16 hr fast in the C57Bl/6JM r mouse (Figure 
21 A; PO.05). Interestingly, while food intake for the first four hours is significantly 
inhibited relati ve"to v^ is also significantly leas inhibited in the 

C57B 1/6J-/4 r animal than in the C57B1/6J animal given the same 3nmol dose {compare* 
Figure 20A versus Figure 21 A, 1-4 hrs; P<0.001). The decreased effectiveness of the 
agonist in the presence of the A Y allele is consistent with the proposal that this allele 
results in chronic expression of agouti peptide melanocortin antagonist in the brain. 

MTII, upon co-administration, also significantly inhibited the profound 
stimulation of feeding induced by NPY, measured over a three hr period (Figure 21C; 
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PO.005). Co-administration of an approximately two-fold molar excess of MTII 
produced a 74% inhibition of NPY-stimulated food intake at the three hour time point. 

Finally, MTII also inhibited hypetphagia due to absence of leptin in the 
C57Bl/6J-Lep ob mouse (Figure 21C; PO.001). MTII potently blocked feeding (Figure 
20A) in these animals, in contrast to the less potent inhibition described above for the 
C57Bl/6J-^ r mouse. 

. The C57B1/6J-Z^p^ animal was also used to test the ability of MTII to regulate 
feeding when administered peripherally. Moderate doses (lOOnmol) of MTII inhibited 
feeding in the C57B 1/6 J -Lep^ mouse (PO.001) while low doses (lOnmol) did not (date 
not shown). * The kinetics were similar to those seen with ICV administration, with a 
potent inhibition of feeding for the first four hours. The 1 00-fold higher dose required 
peripherally, as well as the similar kinetics, suggest a primarily central nervous system 
-based mechanism of action of MTIL 
i These data show that melanocortinergic neurons exert a tonic inhibition of 

feeding behavior, and that disruption of this signal leads to hyperphagia. With regard 
to the recently-discovered leptin hormone made by adipocytes, which is generally 
expressed at elevated levels in obese humans and rodents (such as the C57B1/6 J -Lep ob 
animal), the regulatory defect is understood to be an incapacity to respond properly to 
the leptin hormone signal- The instant results indicate that the melanocortins act 
independently, and physiologically "downstream," from the leptin hormone/receptor 
interaction, because it has been shown herein that melanocortin receptor agonists can 
potently inhibit feeding in the C57B V6}-Lep ob animal. 

These results suggest that MC receptor agonists and antagonists can affect 
mammalian feeding behavior, and provide a means for determining candidate 
compounds for the development of effective pharmacological products directed towards 
- alleviating suc h hum an ailments as obesity, anorexia and cachexia. 

EXAMPLE 6 

Use of MC Receptor-Expressing Recombinant Cells for Screening Compounds 
that Affect Feeding Behavior in Mammals 

The results obtained in Example 5 indicated that cells expressing a variety of 

mammalian melanocortin receptors are useful for characterizing compounds as a first 
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step towards developing MC receptor agonists and antagonists for controlling feeding 
behavior in mammals, particularly obesity and overweight disorders in general, as well 
as anorexia, cachexia and other failure-to-thrive disorders. 

A panel of mammalian melanocortin receptor-expressing recombinant cells are 
provided as described above in Example 3, wherein each member of the panel comprises 
appropriate mammalian cells, such as human 293 cells, comprising a recombinant 
expression construct encoding the MC-1, MC-2 (ACTH), MC-3, MC-4 or MC-5 
receptor, the panel constructed to comprise cells functionally expressing each of these 
MC receptor proteins. 

The panel is used as follows. Receptor agonist activity is assayed by transient 
or stable expression of a protein which produces a metabolite reporter molecule in 
response to receptor binding by any of the MC receptor proteins. An example of such 
a reporter system is the recombinant expression construct described in Example 4, 
wherein cAMP responsive elements (CREs) are operatively linked to a bacterially- 
derived P-galactosidase (P-gal) gene. In the event of receptor binding, cAMP is 
produced in the mammalian cell, and the CRE induces P-gal expression. When co- 
incubated with a colorless substrate for p-gal, receptor binding results in conversion of 
the colorless substrate to a blue-colored product, which can be easily scored visually or 
spectrophotometrically. Alternative reporter genes, such a luciferase, can also be used 
as reporter systems, provided that expression of the reporter molecule-producing protein 
is functionally linked to receptor binding of a test compound. Alternatively, cAMP 
production resulting from MC receptor binding can also be measured directly. 

Assay panels are arranged so that agonist activity can be identified, quantitated 
and correlated with expression of each MC receptor. Automated assays using such 
panels are also envisioned, whereby the qualitative and quantitative detection of a 
reporter metabolite is detected in an array (such as a 96-well tissue culture plate) and the 
data collected and assembled into a computer data-base or other analytical program. 

Antagonist activity is detected by a modification of the above assay. In this 
assay, the inhibition of cAMP production by a standardized amount of a known receptor 
agonist, specific for each receptor, is assayed in the presence of a putative antagonist 
compound. Production of metabolite reporter molecules and their qualitative and 
quantitative detection is achieved as described above, and the specificity and potency of 
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each antagonist compound characterized with regard to the degree of inhibition achieved 
for each receptor. 

In view of the instant disclosure, MC-3/MC-4 receptor antagonists are expected 
to be useful to inhibit food intake in a hungry animal, and MC-3/MC-4 receptor agonists 
are expected to be useful to increase food intake in an animal. Alternative patterns of 
feeding behavior associated with different patterns of MC receptor agonist/antagonist 
activity can be determined using this assay. 

Compounds having agonist or antagonist activity with the MC-3 or MC-4 
receptors detected using this assay are further screened in vivo to determine whether the 
observed receptor binding activity results in modification of feeding behavior when 
administered to an animal. In these assays, the MC receptor binding compounds 
detected using the assay are administered intracranioventricularly as described above in 
Example 5 to animals after an overnight fast, to waking animals, or to animals that are 
not otherwise primed to be hungry. Feeding and locomotor activity is monitored in these 
animals, and compounds affecting eating behavior (either by inhibiting feeding in 
otherwise hungry animals or stimulating feeding in otherwise sated animals) are selected 
for further development. 

In addition, systemic administration of compounds found to be active by ICV 
administration assays is used to screen such compounds for the ability to cross the blood- 
brain barrier. Such compounds are also useful as templates for modifications aimed at 
increasing the availability of these compounds in the brain after systemic administration, 
for increasing bioactivity, or both. 

It should be understood that the foregoing disclosure emphasizes certain specific 
embodiments of the invention and that all modifications or alternatives equivalent 
thereto are within the spirit and scope of the invention as set forth in the appended 
claims.... ; __ : 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Oregon Health Sciences University 

(B) STREET: 31B1 S.W. Sam Jackson Park Road 

(C) CITY: Portland 

(D) STATE: Oregon 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 97201 

(G) TELEPHONE: 503-494-8200 

(H) TELEFAX: 503-494-4729 

(ii) TITLE OF INVENTION: Methods and Reagents for Discovering and 

Using Mammalian Melanocortin Receptor Agonists and Antagonists 
To Modulate Feeding Behavior in Animals 

(iii) NUMBER OF SEQUENCES : 19 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS /MS - DOS 

{D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(v) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/ KEY : mics feature „_ 

(B) LOCATION: 1..35 

(D) OTHER INFORMATION: /function = "Degenerate 
oligonucleotide primer (sense)" 
/note~ "The residue at positions 24 and 24 are 
inosine" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GAGTCGACCT GTGYGYSATY RCNNTKGACM GSTAC 35 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid • 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: mics_f eature 

(B) LOCATION: 1..32 

(D) OTHER INFORMATION: /function = "Degenerate 
oligonucleotide primer (antisense) " 
/note= "The residue at position 18 is inosine" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0;2: 

CAGAATTCAG WAGGGCANCC AGCAGASRYG AA 32 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(ix) FEATURE: 

(A) NAME/KEY: S'UTR 

(B) LOCATION: 1 . . 14 

( ix ) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 15.. 959 



(ix) FEATURE: 

(A) NAME/ KEY: 3 , UTR 

(B) LOCATION: 960.. 1260 



41 



WO 98/10068 



PCT/US97/15565 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TTCCTGACAA GACT ATG TCC ACT CAG GAG CCC CAG AAG AGT CTT CTG GGT 50 
Met Ser Thr Gin Glu Pro Gin Lys Ser Leu Leu Gly 
1 5 10 

TCT CTC AAC TCC AAT GCC ACC TCT CAC CTT GGA CTG GCC ACC AAC CAG 98 
Ser Leu Asn Ser Asn Ala Thr Ser His Leu Gly Leu Ala Thr Asn Gin 

15 .20 25 

TCA GAG CCT TGG TGC CTG TAT GTG TCC ATC CCA GAT GGC CTC TTC CTC 14 6 

Ser Glu Pro Trp Cys Leu Tyr Val Ser lie Pro Asp Gly Leu Phe Leu 
30 35 40 

AGC CTA GGG CTG GTG AGT CTG GTG GAG AAT GTG CTG GTT GTG ATA GCC 194 
Ser Leu Gly Leu Val Ser Leu Val Glu Asn Val Leu Val Val lie Ala 
45 50 55 60 

ATC ACC AAA AAC CGC AAC CTG CAC TCG CCC ATG TAT TAC TTC ATC TGC 24 2 

lie Thr Lys Asn Arg Asn Leu His Ser Pro Met Tyr Tyr Phe lie Cys 
65 70 75 

TGC CTG GCC CTG TCT GAC CTG ATG GTA AGT GTC AGC ATC GTG CTG GAG 2 90 

Cys Leu Ala Leu Ser Asp Leu Met Val Ser Val Ser lie Val Leu Glu 
80 85 90 

ACT ACT ATC ATC CTG CTG CTG GAG GTG GGC ATC CTG GTG GCC AGA GTG 33 8 

Thr Thr lie He Leu Leu Leu Glu Val Gly He Leu Val Ala Arg Val 
95 100 105 

GCT TTG GTG CAG CAG CTG GAC AAC CTC ATT GAC GTG CTC ATC TGT GGC 3 86 

Ala Leu Val Gin Gin Leu Asp Asn Leu He Asp Val Leu He Cys Gly 
110 115 120 

TCC ATG GTG TCC AGT CTC TGC TTC CTG GGC ATC ATT GCT ATA GAC CGC 4 34 

Ser Met Val Ser Ser Leu Cys Phe Leu Gly He He Ala He Asp Arg 
125 130 135 140 



TAC ATC TCC ATC TTC TAT GCG CTG CGT TAT CAC AGC ATC GTG ACG CTG 4 82 

Tyr He Ser He Phe Tyr Ala Leu Arg Tyr His Ser He Val Thr Leu 
145 150 155 

CCC AGA GCA CGA CGG GCT GTC GTG G GC ATC TGG ATG GTC AGC ATC__CTC 5 30 

""Pro Arg Ala Ar^7^gnuT~VaT"VaF Trp Met Val Ser He Val 
160 165 170 

TCC AGC ACC CTC TTT ATC ACC TAC TAC AAG CAC ACA GCC GTT CTG CTC . 5 78 

Ser Ser Thr Leu Phe He Thr Tyr Tyr Lys His Thr Ala Val Leu Leu 
175 180 1B5 
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TGC CTC GTC ACT TTC TTT CTA GCC ATG CTG GCA CTC ATG GCG ATT CTG 626 
Cys Leu Val Thr Phe Phe Leu Ala Met Leu Ala Leu Met Ala He Leu 
190 195 200 

TAT GCC CAC ATG TTC ACG AGA GCG TGC CAG CAC GTC CAG GGC ATT GCC 674 
Tyr Ala His Met Phe Thr Arg Ala Cys Gin His Val Gin Gly He Ala 
205 210 215 220 

CAG CTC CAC AAA AGG CGG CGG TCC ATC CGC CAA GGC TTC TGC CTC AAG 722 
Gin Leu His Lys Arg Arg Arg Ser He Arg Gin Gly Phe Cys Leu Lys 
225 230 235 

GGT GCT GCC ACC CTT ACT ATC CTT CTG GGG ATT TTC TTC CTG TGC TGG * 77 0 

Gly Ala Ala Thr Leu Thr He Leu Leu Gly He Phe Phe Leu Cys Trp 
240 245 250 

GGC CCC TTC TTC CTG CAT CTC TTG CTC ATC GTC CTC TGC CCT CAG CAC 818 
Gly Pro Phe Phe Leu His Leu Leu Leu He Val Leu Cys Pro Gin His 
255 260 265 

CCC ACC TGC AGC TGC ATC TTC AAG AAC TTC AAC CTC TTC CTC CTC CTC 666 
Pro Thr Cys Ser Cys He Phe Lys Asn Phe Asn Leu Phe Leu Leu Leu 
270 275 280 

ATC GTC CTC AGC TCC ACT GTT GAC CCC CTC ATC TAT GCT TTC CGC AGC 914 
He Val Leu Ser Ser Thr Val Asp Pro Leu He Tyr Ala Phe Arg Ser 
285 290 J 295 300 

CAG GAG CTC CGC ATG ACA CTC AAG GAG GTG CTG CTG TGC TCC TGG 95 9 

Gin Glu Leu Arg Met Thr Leu Lys Glu Val Leu Leu Cys Ser Trp 
305 310 315 

TGATCAGAGG GCGCTGGGCA GAGGGTGACA GTGATATCCA GTGGCCTGCA TCTGTGAGAC 1019 

CACAGGTACT CATCCCTTCC TGATCTC CAT TTGTCTAAGG GTCGACAGGA TGAGCTTTAA 1079 

AATAGAAACC C AGAGTG CCT GGGGCCAGGA GAAAGGGTAA CTGTGACTGC AGGGCTCACC 113 9 

CAGGGCAGCT ACGGGAAGTG GAGGAGACAG GGATGGGAAC TCTAGCCCTG AG CAAGGGTC 1199 

AG AC CACAGG CTCCTGAAGA GCTTCACCTC TCCCCACCTA CAGGCAACTC CTG CTC AAGC 1259 

C 1260 



(2) INFORMATION FOR SEQ ID NO : 4 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ser Thr Gin Glu Pro Gin Lys Ser Leu Leu Gly Ser Leu Asn Ser 
1 5 10 15 

Asn Ala Thr Ser His Leu Gly Leu Ala Thr Asn Gin Ser Glu Pro Trp 
20 25 30 

Cys Leu Tyr Val Ser lie Pro Asp Gly Leu Phe Leu Ser Leu Gly. Leu 
35 . 40 45 

Val Ser Leu Val Glu Asn Val Leu Val Val lie Ala He Thr Lys Asn 
50 55 60 

Arg Asn Leu His Ser Pro Met Tyr Tyr Phe He Cys Cys Leu Ala Leu 
65 70 75 80 

Ser Asp Leu Met Val Ser Val Ser He Val Leu Glu Thr Thr He He 
85 90 95 

Leu Leu Leu Glu Val Gly He Leu Val Ala Arg Val Ala Leu Val Gin 
100 105 110 

Gin Leu Asp Asn Leu He Asp Val Leu He Cys Gly Ser Met Val Ser 
115 120 125 

Ser Leu Cys Phe Leu Gly He He Ala He Asp Arg Tyr He Ser He 
130 135 140 

Phe Tyr Ala Leu Arg Tyr His Ser He Val Thr Leu Pro Arg Ala Arg 
145 150 155 160 

Arg Ala Val Val Gly lie Trp Met Val Ser He Val Ser Ser Thr Leu 
165 170 175 

Phe He Thr Tyr Tyr Lys His Thr Ala Val Leu Leu Cys Leu Val Thr 
1B0 185 190 

Phe Phe Leu Ala Met Leu Ala Leu Met Ala He Leu Tyr Ala His Met 
195 200 205 

Phe Thr Arg Al a Cys Gin His Val Gin Gly H e Ala Gin Leu H is Lys 

210 215 220 



Arg Arg Arg Ser He Arg Gin Gly Phe Cys Leu Lys Gly Ala Ala Thr 

225 230 235 240 

Leu Thr He Leu Leu Gly He Phe Phe Leu Cys Trp Gly Pro Phe Phe 
245 250 255 



44 



\ 

WO 98/10068 PCT/US97/15565 

Leu His Leu Leu Leu lie Val Leu Cys Pro Gin His Pro Thr Cys Ser 
560 265 . 270 

Cys lie Phe Lys Asn Phe Asn Leu Phe, Leu Leu Leu lie Val Leu Ser 
275 280 285 

Ser Thr Val Asp Pro Leu lie Tyr Ala Phe Arg Ser Gin Glu Leu Arg 
290 295 300 

Met Thr Leu Lys Glu Val Leu Leu Cys Ser Trp 
305 310 315 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1633 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to TtiRNA 



(ix) FEATURE: 

(A) NAME /KEY : 5 ■ UTR 

(B) LOCATION: 1..461 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 462.. 1415 

(ix) FEATURE: 

(A) NAME/ KEY : 3 1 UTR 

(B) LOCATION: 1416.. 1633 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CCCGCATGTG GCCGCCCTCA ATGGAGGGCT CTGAGAACGA CTTTTAAAAC GCAGAGAAAA 60 

AGCTCCATTC TTCCCAGACC TCAGCGCAGC CCTGGCCCAG GAAGGGAGGA GACAGAGGCC 120 

AGGACGGTCC AGAGGTGTCG AAATGTC CTG GGAACCTGAG CAGCAGCCAC CAGGGAAGAG 180 

GCAGGGAGGG AGCTGAGGAC CAGGCTTGGT TGTGAGAATC CCTGAGCCCA GG CGGTTG AT 24 0 

GCCAGGAGGT GTCTGGACTG GCTGGGCCAT GCCTGGGCTG ACCTGTCCAG CCAGGGAGAG • 3 00- 
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GGTGTGAGGG CAGATCTGGG GGTGCCCAGA TGGAAGGAGG CAGGCATGGG GACACCCAAG 360 

GCCCCCTGGC AGCACCATGA ACTAAGCAGG ACACCTGGAG GGGAAGAACT GTGGGGACCT 420 

GGAGGCCTCC AACGACTCCT TCCTGCTTCC TGGACAGGAC T ATG GCT GTG CAG 4 73 

Met Ala Val Gin 
1 

GGA TCC CAG AGA AGA CTT CTG GGC TCC CTC AAC TCC ACC CCC ACA GCC 521 
Gly Ser Gin Arg Arg Leu Leu Gly Ser Leu Asn Ser Thr Pro Thr Ala 
5 10 15 20 _ 

ATC CCC CAG CTG GGG CTG GCT GCC AAC CAG ACA GGA GCC CGG TGC CTG 569 
lie Pro Gin Leu Gly Leu Ala Ala Asn Gin Thr Gly Ala Arg Cys Leu 
25 30 35 

GAG GTG TCC ATC TCT GAC GGG CTC TTC CTC AGC CTG GGG CTG GTG AGC 617 
Glu Val Ser lie Ser Asp Gly Leu Phe Leu Ser Leu Gly Leu Val Ser 
40 45 50 

TTG GTG GAG AAC GCG CTG GTG GTG GCC ACC ATC GCC AAG AAC CGG AAC 665 
Leu Val Glu Asn Ala Leu Val Val Ala Thr lie Ala Lys Asn Arg Asn 
55 60 65 

CTG CAC TCA CCC ATG TAC TGC TTC ATC TGC TGC CTG GCC TTG TCG GAC 713 
Leu His Ser Pro Met Tyr Cys Phe He Cys Cys Leu Ala Leu Ser Asp 
70 75 80 

CTG CTG GTG AGC GGG ACG AAC GTG CTG GAG ACG GCC GTC ATC CTC CTG 761 
Leu Leu Val Ser Gly Thr Asn Val Leu Glu Thr Ala Val He Leu Leu 
85 90 95 100 

CTG GAG GCC GGT GCA CTG GTG GCC CGG GCT GCG GTG CTG CAG CAG CTG 809 
Leu Glu Ala Gly Ala Leu Val Ala Arg Ala Ala Val Leu Gin Gin Leu 
105 HO 115 

GAC AAT GTC ATT GAC GTG ATC ACC TGC AGC TCC ATG CTG TCC AGC CTC 857 
Asp Asn Val He Asp Val He Thr Cys Ser Ser Met Leu Ser Ser Leu 
120 125 130 

TGC TTC CTG GGC GCC ATC GCC . GTG GAC CGC TAC ATC TCC ATC TTC TAC 905 
Cys Phe Leu Gly Ala He Ala Val Asp Arg Tyr He Ser He Phe Tyr 
135 140 145 



GCA CTG "CGC TAC^AGC"ATC GTG ACC CTG CCG CGG GCG CCG CGA GCC 953 
Ala Leu Arg Tyr His Ser lie Val Thr Leu Pro Arg Ala Pro Arg Ala 
150 155 160 

GTT GCG GCC ATC TGG GTG GCC AGT GTC GTC TTC AGC ACG CTC TTC ATC 1001 
Val Ala Ala He Trp Val Ala Ser Val Val Phe Ser Thr Leu Phe He 
165 170 175 1B0 
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GGC TAC TAC t GAC CAC GTG GCC GTC CTG CTG TGC CTC GTG GTC TTC TTC ♦ 104 9 
Gly Tyr Tyr Asp His Val Ala Val Leu Leu Cys Leu Val Val Phe Phe 
185 190 19S 

CTG GCT ATG CTG GTG CTC ATG GCC GTG CTG, GAC GTC CAC ATG CTG GCC > 1097 

Leu Ala Met Leii Val Leu Met Ala Val Leu Asp Val His Met Leu Ala 
200 205 210 

CGG GCC TGC CAG CAC GCC CAG GGC ATC GCC CGG CTC CAC AAG AGG CAG 1145 
Arg Ala Cys Gin His Ala Gin Gly lie Ala Arg Leu His Lys Arg Gin 
215 220 225 

CGC CCG GTC CAC CAG GGC TTT GGC CTT "AAA GGC GCT GTC ACC CTC ACC ,1193 
Arg Pro Val His Gin Gly Phe Gly Leu Lys Gly Ala Val Thr Leu Thr 
230 235 240 

ATC CTG CTG GGC ATT TTC TTC CTC TGC TGG GGC CCC TTC TTC CTG CAT 1241 
He Leu Leu Gly He Phe Phe Leu Cys Trp Gly Pro Phe Phe Leu His 
245 250 255 260 

CTC ACA CTC ATC GTC CTC TGC CCC GAG CAC CCC ACG TGC GGC TGC ATC 12 8 9 

Leu Thr Leu He Val Leu Cys Pro Glu His Pro Thr Cys Gly Cys He 
265 270 275 

TTC AAG AAC TTC AAC CTC TTT CTC GCC CTC ATC ATC TGC AAT GCC ATC 1337 
Phe Lys Asn Phe Asn Leu Phe Leu Ala Leu He He Cys Asn Ala He 
280 285 290 

ATC GAC CCC CTC ATC TAC GCC TTC CAC AGC CAG GAG CTC CGC AGG ACG 13 85 

He Asp Pro Leu He Tyr Ala Phe His Ser Gin Glu Leu Arg Arg Thr 
295 300 305 

CTC AAG GAG GTG CTG ACA TGC TCC TGG TGA GCGCGGTGCA CGCGCTTTAA 1435 
Leu Lys Glu Val Leu Thr Cys Ser Trp * 
310 315 

GTGTGCTGGG CAGAGGGAGG TGGTGATATT GTGGTCTGGT TCCTGTGTGA CCCTGGGCAG 14 95 

TTCCTTACCT CCCTGGTCCC CGTTTGTCAA AGAGGATGGA CTAAATGATC TCTGAAAGTG 1555 

TTGAAGCGCG GACCCTTCTG GGCAGGGAGG GGTCCTGCAA AACTCCAGGC AGGACTTCTC 1615 

ACCAGCAGTC GTGGGAAC 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) -- LENGTH: 317 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 



1633 
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(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
Met Ala Val Gin Gly Ser Gin Arg Arg Leu Leu Gly Ser Leu Asn Ser 



1 s ao 15 

Thr Pro Thr Ala lie Pro Gin Leu Gly Leu Ala Ala Asn Gin Thr Gly 
20 25 30 

Ala Arg CyB Leu Glu Val Ser He Ser Asp Gly Leu Phe Leu Ser Leu 
35 40 45 

Gly Leu Val Ser Leu Val Glu Asn Ala Leu Val Val Ala Thr He Ala 
50 55 60 

Lys Asn Arg Asn Leu His Ser Pro Met Tyr Cys Phe He Cys Cys Leu 
65 70 75 80 

Ala Leu Ser Asp Leu Leu Val Ser Gly Thr Asn Val Leu Glu Thr Ala 

85 90 95 

Val He Leu Leu Leu Glu Ala Gly Ala Leu Val Ala Arg Ala Ala Val 
100 105 110 

Leu Gin Gin Leu Asp Asn Val He Asp Val He Thr Cys Ser Ser Met 
H5 120 125 

Leu Ser Ser Leu Cys Phe Leu Gly Ala He Ala Val Asp Arg Tyr lie 
130 135 140 

Ser He Phe Tyr Ala Leu Arg Tyr His Ser He Val Thr Leu Pro Arg 
145 150 155 160 

Ala Pro Arg Ala Val Ala Ala He Trp Val Ala Ser Val Val Phe Ser 
165 170 175 

Thr Leu Phe He Gly Tyr Tyr Asp His Val Ala Val Leu Leu Cys Leu 
180 185 190 

Val Val Phe Phe Leu Ala Met Leu Val Leu Met Ala Val Leu Asp Val 
195 200 205 

His Met Leu Ala Arg Ala Cys Gin His Ala Gin Gly He Ala Arg Leu 

- - ---210 215 220 - - — ■ ~ ■ 

His LyB Arg Gin Arg Pro Val His Gin Gly Phe Gly Leu Lys Gly Ala 
225 230 235 240 

Val Thr Leu Thr He Leu Leu Gly He Phe Phe Leu Cys Trp Gly Pro 
245 250 255 
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Phe Phe Leu His Leu Thr Leu lie 
260 

Cys Gly Cys lie Phe Lys Asn Phe 

275 280 

Cys Asn Ala lie lie Asp Pro Leu 
290 295 

Leu Arg Arg Thr Leu Lye Glu Val 
305 310 



Val Leu Cys Pro Glu His Pro Thr 

265 t! * 270 " "V 

Asn Leu Phe Leu Ala Leu lie lie 
265 

lie Tyr Ala Phe His Ser Gin Glu 
300 

Leu Thr Cys Ser Trp * 
315 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH ; 2012 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(ix) FEATURE: 

(A) NAME/ KEY: 5 1 UTR 

(B) LOCATION: 1. .693 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 694 1587 

(ix) FEATURE: 

(A) NAME /KEY : 3 1 UTR 

(B) LOCATION: 15B8..2012 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ACAACACTTT ATATATATTT TTATAAATGT AAGGGGTACA AAGGTG CC AT TTTGTTACAT 60 

GGATATACCG TGTAGTGGTG AAGCCTGGGC TTTTAGTGTA TCTG TCATCA GAATAACATA 120 

CGTGTTACCC ATAGGAATTT CTCATCACCC GCCCCCTCCA CCCTTCGAGT CTCCAATGTC 180 

. CATTCCACAC TCTA^TCCA_ CGTCT CATATAAGTG AGAACATGTA 24 0 

GTATTTGACT TCCTCTTTCT GAGTTATTTC ACTTTGATAA TGGCCTCCAC TTCCATCCAT 3 00 

GTTGCTGCAA AAGACATGAC CTTATTCTTT TTGATAGCTG GGGAGTACTC CATTGTGTAT 360 

ATGTACCACA TTTCTTTATC CATTCACCCA TTGAGAACAC TTAGTTGATT CCATATCTTT 420 
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GCTATTGTCA CTAGTGCTGC AATAAACATA CATGTGCAGG CTCCTTCTAA TATACTGATT 480 

TATATTTTAT GGAGAGAGAT AGAGTTCTTA GCGAGTGTGC TGTTTATTTC TAGTGTACTT 54 0 

GCAACTAATA TTCTGTATAC TCCCTTTAGG TGATTGGAGA TTTAACTTAG ATCTCCAGCA 600 

AGTGCTACAA GAAGAAAAGA TCCTGAAGAA TCAATCAAGT TTCCGTGAAG TCAAGTCCAA 660 

GTAACATCCC CGCCTTAACC ACAAGCAGGA GAA ATG AAG CAC ATT ATC AAC TCG 714 

Met Lys His lie lie Asn Ser 
1 5 

TAT GAA AAC ATC AAC AAC ACA GCA AGA AAT AAT TCC GAC TGT CCT CGT ' 762 

Tyr Glu Asn He Asn Asn Thr Ala Arg Asn Asn Ser Asp Cys Pro Arg 
10 15 20 

TGT GTT TTG CCG GAG GAG ATA TTT TTC ACA ATT TCC ATT GTT GGA GTT 810 
Cys Val Leu Pro Glu Glu He Phe Phe Thr He Ser lie Val Gly Val 
25 30 35. 

TTG GAG AAT CTG ATC GTC CTG CTG GCT GTG TTC AAG AAT AAG AAT CTC 858 
Leu Glu Asn Leu He Val Leu Leu Ala Val Phe Lys Asn Lys Asn Leu 
40 45 50 55 

CAG GCA CCC ATG TAC TTT TTC ATC TGT AGC TTG GCC ATA TCT GAT ATG 906 
Gin Ala Pro Met Tyr Phe Phe He Cys Ser Leu Ala lie Ser Asp Met 
60 65 70 

CTG GGC AGC CTA TAT AAG ATC TTG GAA AAT ATC CTG ATC ATA TTG AGA 954 
Leu Gly Ser Leu Tyr Lys He Leu Glu Asn He Leu He He Leu Arg 
75 80 85 

AAC ATG GGC ATA CTC AAG CCA CGT GGC ACT TTT GAA ACC ACA GCC CAT 1002 
Asn Met Gly He Leu Lys Pro Arg Gly Ser Phe Glu Thr Thr Ala His 
90 95 100 

GAC ATC ATC GAC TCC CTG TTT CTG CTC TCC CGT CTT GGC TCC ATC TTC 1050 
Asp He He Asp Ser Leu Phe Leu Leu Ser Arg Leu Gly Ser He Phe 
105 110 115 

GAC CTG CTC GTG ATT GCT GCG GAC CGC TAC ATC ACC ATC TTC CAC GCA 10 9 B 

Asp Leu Leu Val He Ala Ala Asp Arg Tyr He Thr He Phe His Ala 
120 125 130 135 

- CTG CGG TAC CAC~AGC— ATC GTG ACC — ATG— GG C— CGC ACT— GTG -GTG GTG -CTT— -114 6 
Leu Arg Tyr His Ser He Val Thr Met Arg Arg Thr Val Val Val Leu 
140 145 150 

ACG GTC ATC TGG ACG TTC TGC ACG GGG ACT GGC ATC ACC ATG GTG ATC 1194 
Thr Val He Trp Thr Phe Cys Thr Gly Thr Gly He Thr Met Val He 
155 160 165 
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TTC TCC CAT CAT GTG CCC CAC GTG ATC ACC TTC ACG TCG CTG TTC CCG 124 2 

Phe Ser His His Val Pro^His yal He Thr Phe Thr Ser Leu Phe Pro 

170 175 ** 1B0 ' 

CTG ATG CTG GTC TTC ATC CTG TGC CTC TAT GTG CAC ATG TTC CTG CTG 12 90 

Leu Met Leu Val Phe He Leu Cys Leu Tyr Val His Met Phe Leu Leu 

185 190 195 



TAC TGG TAG AATGGCTGAT CCCTGGTTTT AGAATCCATG GGAATAACGT 
Tyr Trp * 



1338 



GCT CGA TGG CAC ACC AGG AAG ATC TCC ACC CTC CCC AGA GCC AAC ATG 
Ala Arg Trp His Thr Arg Lys He Ser Thr Leu Pro Arg Ala Asn Met 
200 * ; 205 210 215 

AAA GGG GCC ATG ACA CTG ACC ATC CTG CTC GGG GTC TTC ATC TTC TGC , 13 86 

Lys Gly Ala Met Thr Leu Thr He Leu Leu Gly Val Phe He Phe Cys 
220 225 230 

TGG GCC CCC TTT GTG CTT CAT GTC CTC TTG ATG ACA TTC TGC CCA AGT 14 34 

Trp Ala Pro Phe Val Leu His Val Leu Leu Met Thr Phe Cys Pro Ser 
235 240 245 

AAC CCC TAC TGC GCC TGC TAC ATG TCT CTC TTC CAG GTG AAC GGC ATG 14 82 

Asn Pro Tyr Cys Ala Cys Tyr Met Ser Leu Phe Gin Val. Asn Gly Met 
250 255 260 

TTG ATC ATG TGC AAT GCC GTC ATT GAC CCC TTC ATA TAT GCC TTC CGG 1530 
Leu lie Met Cys Asn Ala Val He Aep Pro Phe He Tyr Ala Phe Arg 
265 270 275 

AGC CCA GAG CTC AGG GAC GCA TTC AAA AAG ATG ATC TTC TGC AGC AGG 157 8 

Ser Pro Glu Leu Arg v Asp Ala Phe Lys Lys Met He Phe Cys Ser Arg 
280 285 290 295 



1627 



TGCCAAGTGC CAGAATAGTG TAACATTCCA ACAAATGCCA GTGCTCCTCA CTGGCCTTCC 1687 

TTC CCTAATG GATGCAAGGA TGACCCACCA GCTAGTGTTT CTGAATACTA TGGCCAGGAA 1747 

CAGTCTATTG TAGGGGCAAC TCTATTTGTG ACTGGACAGA TAAAACGTGT AGTAAAAGAA 1807 

GGATAGAATA CAAAGTATTA GGTACAAAAG TAATTAGGTT TGCATTACTT ATGACAAATG 1867 

CATTACTTTT GCACCAATCT AGTAAAACAG CAATAAAAAT TCAAGGGCTT TGGGCTAAGG 1927 



CAAAGACTTG CTTOCCTGTG " GACATTAACA AGCCAGTTCT G AGG CGG CCT TTCCAGGTGG 198.7 
AGGCCATTGC AGCCAATTTC AGAGT 20\2 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 97 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Lys His lie lie Asn Ser Tyr Glu Asn lie Asn Asn Thr Ala Arg 
a 5 10 15 

Asn Asn Ser Asp Cys Pro Arg Cys Val Leu Pro Glu Glu lie Phe Phe 
20 25 30 

Thr lie Ser lie Val Gly Val Leu Glu Asn Leu lie Val Leu Leu Ala 
35 40 45 

Val Phe Lys Asn Lys Asn Leu Gin Ala Pro Met Tyr Phe Phe lie Cys 
50 55 60 

Ser Leu Ala lie Ser Asp Met Leu Gly Ser Leu Tyr Lys lie Leu Glu 
65 70 75 80 

Asn lie Leu lie lie Leu Arg Asn Met Gly lie Leu Lys Pro Arg Gly 
85 90 95 

Ser Phe Glu Thr Thr Ala His Asp He He Asp Ser Leu Phe Leu Leu 
100 105 110 

Ser Arg Leu Gly Ser He Phe Asp Leu Leu Val He Ala Ala Asp Arg 
115 120 125 

Tyr He Thr He Phe His Ala Leu Arg Tyr His Ser He Val Thr Met 
130 135 140 

Arg Arg Thr Val Val Val Leu Thr Val He Trp Thr Phe Cys Thr Gly 
145 150 155 160 

Thr Gly He Thr Met Val He Phe Ser His His Val Pro His Val He 
165 170 175 

-Thr- Phe Thr Ser -Leu— Phe Pro-Leu-Met— Leu- Val-PheH Be - Leu -Cys -Leu~ 
1B0 IBS 190 

Tyr Val His Met Phe Leu Leu Ala Arg Trp His Thr Arg Lys He Ser 
195 200 205 

Thr Leu Pro Arg Ala Asn Met Lys Gly Ala Met Thr Leu Thr He Leu 
210 215 220 
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Leu Gly Val Phe lie Phe Cys Trp Ala Pro Phe Val Leu His Val Leu 
225 230 235 , 240 

Leu Met Thr Phe Cys Pro Ser Asn Pro Tyr Cys Ala Cys Tyr Met Ser 
245 250 255 

Leu Phe Gin Val Asn Gly Met Leu lie Met Cys Asn Ala Val lie Asp 
260 265 270 

Pro Phe lie Tyr Ala Phe Arg Ser Pro Glu Leu Arg Asp Ala Phe Lys 
275 280 285 

Lys Met lie Phe Cys Ser Arg Tyr Trp * 
290 295 



(2) INFORMATION FOR SEQ 10 NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1108 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(ix) FEATURE: 

(A) NAME/ KEY: 5 * UTR 

(B) LOCATION: 1..132 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 133.. 1026 

(ix) FEATURE: * 

(A) NAME /KEY : 3 'UTR 

(B) LOCATION: 1027.. 1106 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

GGGGCCAGAA AGTTCCTGCT TC AG AG CAG A AGATCTTCAG CAAGAACTAC AAAGAAGAAA 60 

AGATTCTGGA GAATCAATCA AGTTTCCTGT CAAGTTCCAG "TAACGTTTCT GTCTT AACTG 120 

CACACAGGAA AG ATG AAA CAC ATT CTC AAT CTG TAT GAA AAC CTC AAC 168 
Met Lys His lie Leu Asn Leu Tyr Glu Asn Leu Asn 
1 5 10 
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AGT ACA GCA AGA AAT AAC TCA GAC TGT CCT GCT GTG ATT TTG CCA GAA 216 
Ser Thr Ala Arg Asn Asn Ser Asp Cys Pro Ala Val He Leu Pro Glu 
15 20 25 

GAG ATA TTT TTC ACA GTA TCC ATT GTT GGG GTT TTG GAG AAC CTG ATG 2 64 

Glu He Phe Phe Thr Val Ser He Val Gly Val Leu Glu Asn Leu Met 
30 - 35 40 

GTC CTT CTG GCT GTG GCC AAG AAT AAG ATG CTT CAG TCG CCC ATG TAC 312 
Val Leu Leu Ala Val Ala Lys Asn Lys Met Leu Gin Ser Pro Met Tyr 
45 50 55 60 

TTT TTC ATC TGC AGC TTG GCT ATT TCC GAT ATG CTG GGG AGC ATG TAC 3 60 

Phe Phe He Cys Ser Leu Ala He Ser Asp Met Leu Gly Ser Met Tyr 
65 70 75 

AAG ATT TTG GAA AAC GTT CTG ATC ATG TTC AAA AAC ATG GGT TAC CTC 4 08 

Lys He Leu Glu Asn Val Leu He Met Phe Lys Asn Met Gly Tyr Leu 
80 85 90 

GAG CCT CGA GGC AGT TTT GAA AGC ACA GCA GAT GAT GTG GTG GAC TCC 4 56 

Glu Pro Arg Gly Ser Phe Glu Ser Thr Ala Asp Asp Val Val Asp Ser 

95 100 105 

j 

CTG TTC ATC CTC TCC CTT CTC GGC TCC ATC TGC AGC CTG TCT GTG ATT 5 04 

Leu Phe He Leu Ser Leu Leu Gly Ser He Cys Ser Leu Ser Val He 

110 115 120 

GCC GCT GAC CGC TAC ACT ACA ATC TTC CAC GCT CTG CAG TAC CAC CGC 552 
Ala Ala Asp Arg Tyr Thr Thr He Phe His Ala Leu Gin Tyr His Arg 
125 130 135 140 

ATC ATG ACC CCC GCA CCG TGC CCT CGT CAT CTG ACG GTC CTC TGG CGA 600 
He Met Thr Pro Ala Pro Cys Pro Arg His Leu Thr Val Leu Trp Arg 
145 150 155 



GGC TGC ACA GGC AGT GGC ATT ACC ATC GTG ACC TTC TCC CAT CAC GTC 64 8 

Gly Cys Thr Gly Ser Gly He Thr He Val Thr Phe Ser His His Val , 
160 165 170 

CCC ACA GTG ATC GCC TTC ACA GCG CTG TTC CCG CTG ATG CTG GCC TTC 696 

Pro Thr Val He Ala Phe Thr Ala Leu Phe Pro Leu Met Leu Ala Phe 
175 180 185 



ATC CTG TGC CTC TAC GTG CAC ATG TTC CTG CTG GCC CGC TCC CAC ACC 744 
He Leu Cys Leu Tyr Val His Met Phe Leu Leu Ala Arg Ser His Thr 
190 195 200 

AGG AGG ACC CCC TCC CTT CCC AAA GCC AAC ATG AGA GGG GCC GTC ACA 7 92 

A*"9 Arg Thr Pro Ser Leu Pro Lys Ala Asn Met Arg Gly Ala Val Thr 
205 210 215 220 
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CTG ACT GTC CTG CTC GGG GTC TTC ATT TTC TGT TGG GCA CCC TTT GTC 84 0 

Leu Thr Val Leu Leu Gly Val Phe^ He Phe Cys Trp Ala Pro Phe Val 

225 - y' 230 , / . ** 235 

CTT CAT GTC CTC TTG ATG ACA TTC TGC CCA GCT GAC CCC TAG TGT GCC 888 
Leu His Val Leu Leu Met Thr Phe Cys Pro Ala Asp Pro Tyr Cys Ala 
240 245 ^ 250 

TGC TAC ATG TCC CTC TTC CAG GTG AAT GGT GTG TTG ATC ATG TGT AAT 936 
Cys Tyr Met Ser Leu Phe Gin Val Asn Gly Val Leu He Met Cys Asn 
255 * - 260 265 

GCC ATC ATC GAC CCC TTC ATA TAT GCC TTT CGG AGC CCA GAG CTC AGG % 984 

Ala He He Asp Pro Phe He Tyr Ala Phe Arg Ser Pro Glu Leu Arg 
270 ' 275 280 

GTC GCA TTC AAA AAG ATG GTT ATC TGC AAC TGT TAC CAG TAG 1026 
Val Ala Phe Lys Lys Met Val He Cys Asn Cys Tyr Gin * 
285 290 295 

AATGATTGGT CCCTGATTTT AGGAGCCACA GGGATATACT GTCAGGGACA GAGTAGCGTG 10 B6 

ACAGACCAAC . AACACTAGGA CT 110 8 



(2) INFORMATION FOR . SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Lys His He Leu Asn Leu Tyr Glu Asn Leu Asn Ser Thr Ala Arg 
15 10 15 - 

Asn Asn Ser Asp Cys Pro Ala Val He Leu Pro Glu Glu lie Phe Phe 
20 25 30 

Thr Val Ser He Val Gly Val Leu Glu Asn Leu Met Val Leu Leu Ala 
35 40 45 

VaT Ala Lys Asn"Xys"Met "Leu Gl'ri Ser Pro™ Met" "Tyr Phe" Phe . lie Cys 
50 55 60 

Ser Leu Ala He Ser Asp Met Leu Gly Ser Met Tyr Lys He Leu Glu 
65 70 75 80 
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Asn Val Leu lie Met Phe Lys Asn Met Gly Tyr Leu Glu Pro Arg Gly 
85 90 95 

Ser Phe Glu Ser Thr Ala Asp Asp Val Val Asp Ser Leu Phe lie Leu 
100 105 110 

Ser Leu Leu Gly Ser lie Cys Ser Leu Ser Val lie Ala Ala Asp Arg 
115 120 125 

Tyr Thr Thr lie Phe His Ala Leu Gin Tyr Hie Arg lie Met Thr Pro 
130 135 140 

Ala Pro Cys Pro Arg His Leu Thr Val Leu Trp Arg Gly Cys Thr Gly 
145 150 155 160 

Ser Gly lie Thr lie Val Thr Phe Ser His His Val Pro Thr Val lie 
165 170 175 

Ala Phe Thr Ala Leu Phe Pro Leu Met Leu Ala Phe He Leu Cys Leu 
180 185 ^ 190 

Tyr Val His Met Phe Leu Leu Ala Arg Ser His Thr Arg Arg Thr Pro 
195 200 205 

Ser Leu Pro Lys Ala Asn Met Arg Gly Ala Val Thr Leu Thr Val Leu 
210 215 220 

Leu Gly Val Phe He Phe Cys Trp Ala Pro Phe Val Leu His Val Leu 
225 f 230 235 240 

Leu Met Thr Phe Cys Pro Ala Asp Pro Tyr Cys Ala Cys Tyr Met Ser 
245 250 255 

Leu Phe Gin Val Asn Gly Val Leu He Met Cys Asn Ala He He Asp 
260 265 270 

Pro Phe He Tyr Ala Phe Arg Ser Pro Glu Leu Arg Val Ala Phe Lys 
275 280 285 

Lys Met Val He Cys Asn Cys Tyr Gin * 
290 295 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA to mRNA 



( ix ) FEATURE : 

(A) NAME/ KEY : 5 ' UTR 

(B) 'LOCATION: 1..297 

(ix> FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 298.. 1269 

(ix) FEATURE: 

(A) NAME/ KEY: 3 'UTR 

(B) LOCATION: 1270.. 1338 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGCTGTAACT GTAGCAACCG GTGTTGGGTG GGGATGAGAA GAGACCAGAG AGAGAGAGGG 60 

TCAGAGCGAC AGGGGATGAG ACAGGCTGGT CAGAGTCTGC ACTGATTGTT GGAGACGCAA 12 0 

AGGAAAGTTT TTTCTATGTC TCCAACCTCC CCCTCCTCCC CCGTTTCTCT CTGGAGAAAC 180 

TAAAATGTAG ACTGGACAGC ATCCACAAGA GAAGCACCTA GAAGAAGATT TTTTTTTCCC 240 

AG CAG CTTGC TCAGGACCCT GCAGGAGCTG CAGCCGGAAC TGGTCCCGCC GATAACC 297 

ATG AAC TCT TCC TGC TGC CCG TCC TCC TCT TAT CCG ACG CTG CCT AAC 34 5 

Met Asn Ser Ser Cys Cys Pro Ser Ser Ser Tyr Pro Thr Leu Pro Asn 
15 10 15 

CTC TCC CAG CAC CCT GCA GCC CCC TCT GCC AGO AAC CGG AGT GGC AGT 3 93 

Leu Ser Gin His Pro Ala Ala Pro Ser Ala Ser Asn Arg Ser Gly Ser 
20 25 30 

GGG TTC TGC GAG CAG GTT TTC ATC AAG CCA GAG GTC TTC CTG GCA CTG 441 
Gly Phe Cys Glu Gin Val Phe lie Lys Pro Glu Val Phe Leu Ala Leu 
35 - 40 45 

GGC ATC GTC AGT CTG ATG GAA AAC ATC CTG GTG ATC CTG GCT GTG GTG 489 
Gly lie Val Ser Leu Met Glu Asn He Leu Val lie Leu Ala Val Val 
50 55 60 

AGG AAC GGC AAC CTG CAC TCC CCC ATG TAC TTC TTC CTG CTG AGC CTG 537 
Arg Asn Gly Asn Leu His Ser Pro Met Tyr Phe Phe Leu Leu Ser Leu 

65" "~ 7 0 " 75" "'" 80 

CTG CAG GCC GAC CTG CTG GTG AGC CTG TCC AAC TCC CTG GAG ACC ATC 585 
Leu Gin Ala Asp Leu Leu Val Ser Leu Ser Asn Ser Leu Glu Thr He 
85 90 95 
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ATG ATC GTG GTT ATC AAC AGC GAC TCC CTG ACC TTG GAG GAC CAA TTC 633 
Met lie Val Val lie Asn Ser Asp Ser Leu Thr Leu Glu Asp Gin Phe 
100 1Q5 . - 110 

ATC CAG CAC ATG GAC AAC ATC TTC GAC TCT ATG ATC TGC ATC TCC CTG 6 81 

lie Gin His Met Asp Asn lie Phe Asp Ser Met He Cys He Ser Leu 
115 " 120 125 

GTG GCC TCC ATC TGC AAC CTC CTG GCC ATC GCC GTG GAC AGG TAC GTC 72 9 

Val Ala Ser He Cys Asn Leu Leu Ala He Ala Val Asp Arg Tyr Val 
130 135 140 

ACC ATC TTC TAT GCC CTC CGT TAC CAC AGC ATC ATG ACG GTT AGG AAA 777 
Thr He Phe Tyr Ala Leu Arg Tyr His Ser He Met Thr Val Arg Lys 
145 150 155 160 

GCC CTC TCC TTG ATC GTG GCC ATC TGG GTC TGC TGT GGC ATC TGC GGC 825 
Ala Leu Ser Leu He Val Ala He Trp Val Cys Cys Gly He Cys Gly 
165 170 175 

GTG ATG TTC ATC GTC TAC TCC GAG AGC AAG ATG GTC ATC GTG TGC CTC 673 
Val Met Phe He Val Tyr Ser Glu Ser Lys Met Val He Val Cys Leu 
180 185 190 

ATC ACC ATG TTC TTC GCC ATG GTG CTC CTC ATG GGC ACC CTG TAC ATC 921 
He Thr Met Phe Phe Ala Met Val Leu Leu Met Gly Thr Leu Tyr He 
195 200 205 

CAC ATG TTC CTC TTC GCC AGG CTG CAC GTC CAG CGC ATC GCG GCA CTG 969 
His Met Phe Leu Phe Ala Arg Leu His Val Gin Arg He Ala Ala Leu 
210 215 220 

CCA CCT GCT GAC GGG CTA GCC CCG CAG CAG CAC TCG TGC ATG AAG GGG 1017 
Pro Pro Ala Asp Gly Leu Ala Pro Gin Gin His Ser Cys Met Lys Gly 
225 230 235 240 

GCC GTC ACC ATC ACC ATC CTG CTG GGG GTT TTC ATC TTC TGC TGG GCG 1065 
Ala Val Thr He Thr He Leu Leu Gly Val Phe He Phe Cys Trp Ala 
245 250 255 

CCT TTC TTC CTC CAC CTG GTC CTC ATC ATC ACC TGC CCC ACC AAC CCC 1113 
Pro Phe Phe Leu His Leu Val Leu He He Thr Cys Pro Thr Asn Pro 
260 265 270 

-TAC TGC ATC TGG-TAG-AGG- GGG-CAC~TTG— AAG-AGG -TAG-GTG -GTT-'CTe-ATe 1 1 61 

Tyr Cys lie Cys Tyr Thr Ala His Phe Asn Thr Tyr Leu Val Leu lie 
275 280 285 

ATG TGC AAC TCT GTC ATC GAC CCC CTC ATC TAC GCC TTC CGC AGC CTG 1209 
Met Cys Asn Ser Val He Asp Pro Leu He Tyr Ala Phe Arg Ser Leu 
290 295 300 
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GAG CTG CGA AAC ACC TTC AAG GAG ATT CTC TGC GGT TGC AAT GGC ATG 12 57 

Glu Leu Arg Asn Thr Phe Lys Glu lie Leu Cys Gly Cys Asn Gly Met 
305 "310 ; * 315 : . * 320 

AAC GTG GGC TAG GAACCCCCGA GGAGGTGTTC CACGGCTAGC CAAGAGAGAA 1309 
Asn Val Gly * 



AAG CAATGCT CAGGTGAGAC ACAGAAGGG 



(2) INFORMATION FOR SEQ ID NO: 12: 

" (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Met Asn Ser Ser Cys Cys Pro Ser Ser Ser Tyr Pro Thr Leu Pro Asn 
1 . 5 10 15 

Leu Ser Gin His Pro Ala Ala Pro Ser Ala Ser Asn Arg Ser Gly Ser 
20 25 30 

Gly Phe Cys Glu Gin Val Phe He Lys Pro Glu Val Phe Leu Ala Leu 
35 40 45 

Gly He Val Ser Leu Met Glu Asn He Leu Val He Leu Ala Val Val 
50 55 60 

Arg Asn Gly Asn Leu His Ser Pro Met Tyr Phe Phe Leu Leu Ser Leu 
65 70 75 80 

Leu Gin Ala Asp Leu Leu Val Ser Leu Ser Asn Ser Leu Glu Thr He 
85 90 95 

Met He Val Val He Asn Ser Asp Ser Leu Thr Leu Glu Asp Gin Phe 
100 105 110 

He Gin His Met Asp Asn He Phe Asp Ser Met He Cys He Ser Leu 
H5 120 125 

Val Ala Ser He Cys Asn Leu Leu Ala He Ala Val Asp Arg Tyr Val 
130 135 140 

Thr He Phe Tyr Ala Leu Arg Tyr ' His Ser He Met Thr Val Arg Lys 
145 ' 150 155 " 160 



1338 
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Ala Leu Ser Leu 



Val Met Phe lie 
180 

lie Thr Met Phe 
195 

His Met Phe Leu 
210 

Pro Pro Ala Asp 
225 

Ala Val Thr lie 



Pro £he Phe Leu 
260 

Tyr Cys lie Cys 
275 

Met Cys Asn Ser 
290 

Glu Leu Arg Asn 
305 

Asn Val Gly * 



lie Val Ala He 
165 

Val Tyr Ser Glu 



Phe Ala Met Val 
200 

Phe Ala Arg Leu 
215 

Gly Leu Ala Pro 
230 

Thr He Leu Leu 
245 

His Leu Val Leu 



Tyr Thr Ala His 
280 

Val He Asp Pro 
295 

Thr Phe Lys Glu 
310 



Trp Val Cys Cys 
170 

Ser Lys Met Val 
185 

Leu Leu Met Gly 



His Val Gin Arg 
220 

Gin Gin His Ser 
235 

Gly Val Phe He 
250 

He He Thr Cys 
265 

Phe Asn Thr Tyr 



Leu He Tyr Ala 
300 

He Leu Cys Gly 
315 



Gly He Cys Gly 
175 

He Val Cys Leu 
190 

Thr Leu Tyr lie 
205 

He Ala Ala Leu 



Cys Met Lys Gly 
240 

Phe Cys Trp Ala 
255 

Pro Thr Asn Pro 
270 

Leu Val Leu He 
285 

Phe Arg Ser Leu 



Cys Asn Gly Met 
320 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : mics_£eature 

(B) LOCATION: 1..30 

(D) OTHER INFORMATION: /function = ^Degenerate 
oligonucleotide primer (sense)" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 : 
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GAGTCGACCR CCCATGTAYT DYTTCATCTG 



30 



(2) INFORMATION FOR SEQ ID NO : 15 : 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1671 base pairs 

(B) TYPE: nucleic acid ■* ♦ " 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

fix) FEATURE: 

(A) NAME /KEY : 5 • UTR 

(B) LOCATION: 1..3 93 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 394.. 1389 

(ix) FEATURE: 

(A) NAME /KEY : 3 ' UTR 

(B) LOCATION: 1390.. 1671 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

AGCTTCCGAG AGGCAGCCGA TGTGAGCATG TG CGCACAG A TTCGTCTCCC AATGGCATGG 60 

CAG CTTCAAG GAAAATTATT TTGAACAGAC TTGAATGCAT AAGATTAAAG TTAAAGCAGA 120 

AGTGAGAACA AG AAAG C AAA GAGCAGACTC TTTCAACTGA GAATGAATAT TTTGAAGCCC 18 0 

AAGATTTTAA CGTGATGATG ATTAGAGTCG TAC CTAAAAG AGACTAAAAA CTCCATGTCA 240 

AG CTCTGG AC TTGTGACATT TACTCACAGC AGGCATGGCA ATTTTAGCCT CACAACTTTC 3 00 

AGACAGATAA AGACTTGGAG GAAATAACTG AGACGACTCC CTGACCCAGG AGGTTAAATC 360 

AATTCAGGGG GACACTGGAA TTCTCCTGCC AGC ATG GTG AAC TCC ACC CAC CGT 414 

Met Val Asn Ser Thr His Arg 
1 5 

GGG ATG CAC ACT TCT CTG CAC CTC TGG AAC CGC AGC AGT TAC AGA CTG 4 62 

Gly Met His Thr Ser Leu His Leu Trp Asn Arg Ser Ser Tyr Arg Leu 

10' — : IS"" " -~ 20 " " 

CAC AGC AAT GCC AGT GAG TCC CTT GGA .AAA GGC TAC TCT GAT GGA GGG 510 
His Ser Asn Ala Ser Glu Ser Leu Gly Lys Gly Tyr Ser Asp Gly Gly' 
25 30 35 
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TGC TAC GCG CAA CTT TTT GTC TCT CCT GAG GTG TTT GTG ACT CTG GGT 5 58 

Cys Tyr Ala Gin Leu Phe Val Ser Pro Glu Val Phe Val Thr Leu Gly 
40 45 50 55 

GTG ATC AGC TTG TTG GAG AAT ATC TTA GAG ATT GTG GCA ATA GCC AAG 606 
Val lie Ser Leu Leu Glu Asn lie Leu Glu lie Val Ala lie Ala Lys 

60 * 65 70 

AAC AAG AAT CTG CAT TCA CCC ATG TAC TTT TTC ATC TGC AGC TTG GCT * 654 
A6n Lys Asn Leu His Ser Pro Met Tyr Phe Phe lie Cys Ser Leu Ala 
75 80 85 

GTG GCT GAT ATG CTG GTG AGC GTT TCA AAT GGA TCA GAA ACC ATT ATC 102 
Val Ala Asp Met Leu Val Ser Val Ser Asn Gly Ser Glu Thr He He 
90 95 100 



ATC ACC CTA TTA AAC CGT ACA GAT ACG 
He Thr Leu Leu Asn Arg Thr Asp Thr 
105 110 

AAT ATT GAT AAT GTC ATT GAC TCG GTG 
Asn He Asp Asn Val He Asp Ser Val 
120 125 



GAT GCA CAG AGT TTC ACA GTG 750 
Asp Ala Gin Ser Phe Thr Val 
115 

ATC TGT AGC TCC TTG CTT GCA 7 98 

lie. Cys Ser Ser Leu Leu Ala 
130 135 



TCC ATT TGC AGC CTG CTT TCA ATT GCA GTG GAC AGG TAC TTT ACT ATC 846 
Ser He Cys Ser Leu Leu Ser He Ala Val Asp Arg Tyr Phe Thr He 
140 145 150 

TTC TAT GCT CTC CAG TAC CAT AAC ATT ATG ACA GTT AAG CGG GTT GGG 894 
Phe Tyr Ala Leu Gin Tyr His Asn He Met Thr Val Lys Arg Val Gly 
155 160 165 

ATC AGC ATA AGT TGT ATC TGG GCA GCT TGC ACG GTT TCA GGT ATT TTG 94 2 

He Ser He Ser Cys He Trp Ala Ala Cys Thr Val Ser Gly He Leu 
170 175 180 

TTC ATC ATT TAC TCA GAT AGT AGT GCT GTC ATC ATC TGC CTC ATC ACC 990 
Phe He He Tyr Ser Asp Ser Ser Ala Val He He Cys Leu He Thr 
185 190 195 

ATG TTC TTC ACC ATG CTG GCT CTC ATG GCT TCT CTC TAT GTC CAC CTG 103 8 

Met Phe Phe Thr Met Leu Ala Leu Met Ala Ser Leu Tyr Val His Leu 
200 205 210 215 

j£!£g_„GTG ATG GGG--AGG-^TT— CAC~ATX~AAG— AGG-ATT— GCT- -GTC— CT-C-CCC— GGC- - 1 086- 

Phe Leu Met Ala Arg Leu His He Lys Arg He Ala Val Leu Pro Gly 
220 225 230 

ACT GGT GCC ATC CGC CAA GGT GCC AAT ATG AAG GGA GCG ATT ACC TTG 1134 
Thr Gly Ala He Arg Gin Gly Ala Asn Met Lys Gly Ala He Thr Leu 
235 240 245 
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ACC ATC CTG ATT GGC GTC TTT GTT GTC TGC TGG GCC CCA TTC TTC CTC 1182 
Thr lie Leu lie Gly^ Val Phe Val Val Cys Trp Ala Pro Phe Phe Leu 
250 * 255 260 

CAC TTA ATA TTC TAC ATC TCT TGT CCT CAG AAT CCA TAT TGT GTG TGC 123 0 

His Leu lie Phe Tyr; lie Ser t Cys Pro Gin Asn Pro Tyr Cys Val Cys 
265 270 275 

TTC ATG TCT CAC TTT AAC TTG TAT CTC ATA CTG ATC ATG TGT AAT TCA 127 8 

Phe Met Ser His Phe Asn Leu Tyr Leu lie Leu lie Met Cys Asn Ser 
280 265 , 290 295 

ATC ATC GAT CCT CTG ATT TAT GCA CTC CGG AGT CAA GAA CTG AGG AAA % 1326. 

lie He Asp Pro Leu He Tyr Ala Leu Arg Ser Gin Glu Leu Arg Lys 
300 305 310 

ACC TTC AAA GAG ATC ATC TCT TCC TAT CCC CTG GGA GGC CTT TGT GAC 13 74 

Thr Phe Lyff Glu lie He Ser Ser Tyr Pro Leu Gly Gly Leu Cys Asp 
315 320 325 

TTG TCT AGC AGA TAT TAAATGGGGA CAG AGCACG C AATATAGGAA CATCCATAAG 142 9 
Leu Ser Ser Arg Tyr 
330 

AGACTTTTTC ACTCTTACCC TACCTGAATA TTCTACTTCT GCAACAGCTT TCTCTTCCGT 14 89 

GTAGGGTACT GGTTGAGATA TCCATTGTGT AAATTTAAGC CTATGATTTT TAATGAGAAA 154 9 

AAATGCCCAG TCTCTGTATT ATTTCCAATC TCATGCTACT TTTTTGGCCA TAAAATATGA 1609 

ATCTATGTTA TAGGTTGTAG G CACTGTGG A TTTACAAAAA GAAAAGTCCT TATTAAAAGA 1669 

TT 1671 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Val Asn Ser Thr ^ "His Thr Ser Leu His Leu Trp 

1 . 5 10 15 

Asn Arg Ser Ser Tyr Arg Leu His Ser Asn Ala Ser Glu Ser Leu Gly 
20 25 30 
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Lys Gly Tyr Ser Asp Gly Gly Cys Tyr Ala Gin Leu Phe Val Ser Pro 
35 40 45 

Glu Val Phe Val Thr Leu Gly Val He Ser Leu Leu Glu Asn He Leu 
50 55 60 

Glu He Val Ala He Ala Lys Asn Lys Asn Leu His Ser Pro Met Tyr 
65 70 75 80 

Phe Phe He Cys Ser Leu Ala Val Ala Asp Met Leu Val Ser Val Ser 
85 90 95 

Asn Gly Ser Glu Thr He He He Thr Leu Leu Asn Arg Thr Asp Thr 
100 105 110 

Asp Ala Gin Ser Phe Thr Val Asn He Asp Asn Val He Asp Ser Val 
115 120 125 

He Cys Ser Ser Leu Leu Ala Ser He Cys Ser Leu Leu Ser He Ala 
130 135 140 

Val Asp Arg Tyr Phe Thr He Phe Tyr Ala Leu Gin Tyr His Asn He 
145 150 155 160 

Met Thr Val Lys Arg Val Gly He Ser He Ser Cys He Trp Ala Ala 
165 170 175 

Cys Thr Val Ser Gly He Leu Phe He He Tyr Ser Asp Ser Ser Ala 
180 185 190 

Val He He Cys Leu He Thr Met Phe Phe Thr Met Leu Ala Leu Met 
195 200 205 

Ala Ser Leu Tyr Val His Leu Phe Leu Met Ala Arg Leu His He Lys 
210 215 220 

Arg He Ala Val Leu Pro Gly Thr Gly Ala He Arg Gin Gly Ala Asn 
225 230 235 240 

Met Lys Gly Ala He Thr Leu Thr He Leu He Gly Val Phe Val Val 
245 250 255 

Cys Trp Ala Pro Phe Phe Leu His Leu He Phe Tyr He Ser Cys Pro 
260 265 270 



Gin Asn Pro Tyr Cys Val Cys Phe Met Ser His Phe Asn Leu Tyr Leu 
275 280 285 



He Leu He Met Cys Asn Ser lie He Asp Pro Leu He Tyr Ala Leu 
290 295 300 
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Arg Ser Gin Glu Leu Arg Lys Thr Phe Lys Glu lie He Ser Ser Tyr 

305 310 ' 315 , 320 

Pro Leu Gly Gly Leu Cys Asp Leu Ser Ser Arg Tyr 

325 . 330 



{2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .975 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATG AAC TCC TCC TCC ACC CTG ACT GTA TTG AAT CTT ACC CTG AAC GCC 4 8 

Met Asn Ser Ser Ser Thr Leu Thr Val Leu Asn Leu Thr Leu Asn Ala 
15 10 15 

TCA GAG GAT GGC ATT TTA GGA TCA AAT GTC AAG AAC AAG TCT TTG GCC 96 
Ser Glu Asp Gly lie Leu Gly Ser Asn Val Lys Asn Lys Ser Leu Ala 
20 25 30 

TGT GAA GAA ATG GGC ATT GCC GTG GAG GTG TTC CTG ACC CTG GGT CTC 144 
Cys Glu Glu Met Gly lie Ala Val Glu Val Phe Leu Thr Leu Gly Leu 
35 40 45 

GTC AGC CTC TTA GAG AAC ATC CTG GTC ATT GGG GCC ATA GTA AAG AAC 192 
Val Ser Leu Leu Glu Asn lie Leu Val lie Gly Ala lie Val Lys Asn 
50 55 60 

AAA AAC CTG CAC TCA CCC ATG TAG TTC TTT GTG GGC AGC TTA GCC GTG 24 0 

Lys Asn Leu His Ser Pro Met Tyr Phe Phe Val Gly Ser Leu Ala Val 
65 70 75 80 

-GGG GAC - ATG CTG GTG - AGC ATG TCC AAT GCC TGG GAG. ACT GTC ACC ATA . . 286 

Ala Asp Met Leu Val Ser Met Ser Asn Ala Trp Glu Thr Val Thr He 
85 90 95 

TAC TTG CTA AAT AAT AAA CAC CTG GTG ATA GCC GAC ACC TTT GTG CGA 336 
Tyr Leu Leu Asn Asn Lys His Leu Val He Ala Asp Thr Phe Val Arg 
100 105 110 
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CAC ATC GAC AAC GTG TTC GAC TCC ATG ATC TGC ATC TCT GTG GTG GCC 3 84 

His He Asp Asn Val Phe Asp Ser Met He Cys lie Ser Val Val Ala 
115 120 125 

TCG ATG TGC AGT TTG CTG GCC ATT GCG GTG GAT AGG TAC ATC ACC ATC 432 
Ser Met Cys Ser Leu Leu Ala He Ala Val Asp Arg Tyr He Thr He 
130 135 140 

TTC TAT GCC TTG CGC TAC CAC CAC ATC ATG ACC GCG AGG CGC TCG GGG 4 80 

Phe Tyr Ala Leu Arg Tyr His His lie Met Thr Ala Arg Arg Ser Gly 
145 150 155 160 

GTG ATC ATC GCC TGC ATT TGG ACC TTC TGC ATA AGC TGC GGC ATT GTT 52 8 

Val He He Ala Cys He Trp Thr Phe Cys He Ser Cys Gly He Val 
165 170 175 

TTC ATC ATC TAC TAT GAG TCC AAG TAT GTG ATC ATT TGC CTC ATC TCC 576 
Phe He He Tyr Tyr Glu Ser Lys Tyr Val He He Cys Leu He Ser 
160 185 190 

ATG TTC TTC ACC ATG CTG TTC TTC ATG GTG TCT CTG TAT ATA CAC ATG 624 
Met Phe Phe Thr Met Leu Phe Phe Met Val Ser Leu Tyr He His Met 
195 200 205 

TTC CTC CTG GCC CGG AAC CAT GTC AAG CGG ATA GCA GCT TCC CCC AGA 672 
Phe Leu Leu Ala Arg Asn His Val Lys Arg He Ala Ala Ser Pro Arg 
210 215 220 

TAC AAC TCC GTG AGG CAA AGG ACC AGC ATG AAG GGG GCT ATT ACC CTC 720 
Tyr Asn Ser Val Arg Gin Arg Thr Ser Met Lys Gly Ala He Thr Leu 
225 230 235 240 

ACC ATG CTA CTG GGG ATT TTC ATT GTC TGC TGG TCT CCC TTC TTT CTT 7 68 

Thr Met Leu Leu Gly He Phe He Val Cys Trp Ser Pro Phe Phe Leu 
245 250 255 

CAC CTT ATC TTA ATG ATC TCC TGC CCT CAG AAC GTC TAC TGC TCT TGC 816 
His Leu He Leu Met He Ser Cys Pro Gin Asn Val Tyr Cys Ser Cys 
260 265 270 

TTT ATG TCT TAC TTC AAC ATG TAC CTT ATA CTC ATC ATG TGC AAC TCC 864 
Phe Met Ser Tyr Phe Asn Met Tyr Leu He Leu He Met Cys Asn Ser 
275 280 285 



-GTG ATC GAT CCT CTC~ATC ~TAC ~GCC~ CTC CGC AG C~ CAA GAG ATG CGG AGG 912" 
Val He Asp Pro Leu He Tyr Ala Leu Arg Ser Gin Glu Met Arg Arg 
290 295 300 

ACC TTT AAG GAG ATC GTC TGT TGT CAC GGA TTC CGG CGA CCT TGT AGG 960 
Thr Phe Lys Glu He Val Cys Cys His Gly Phe Arg Arg Pro Cys Arg 
305 310 315 320 
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CTC CTT GGC GGG TAT TAA 97 8 

Leu Leu Gly Gly Tyr 
325 



(2) INFORMATION FOR SEQ ID NO: 18: , 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(-ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 18 : 

Met Asn Ser Ser Ser Thr Leu Thr Val Leu Asn Leu Thr Leu Asn Ala 
15 10 15 

Ser Glu Asp Gly lie Leu Gly Ser Asn Val Lys Asn Lys Ser Leu Ala 
20 25 30 

Cys Glu Glu Met Gly lie Ala Val Glu Val Phe Leu Thr Leu Gly Leu 
35 40 45 

Val Ser Leu Leu Glu Asn lie Leu Val lie Gly Ala lie Val Lys Asn 
50 55 €0 

Lys Asn Leu His Ser Pro Met Tyr Phe Phe Val Gly Ser Leu Ala Val 
65 70 75 80 

Ala Asp Met Leu Val Ser Met Ser Asn Ala Trp Glu Thr Val Thr He 
85 90 95 

Tyr Leu Leu Asn *Asn Lys His Leu Val He Ala Asp Thr Phe Val Arg 
100 105 HO 

His He Asp Asn Val Phe Asp Ser Met He Cys He Ser Val Val Ala 
115 120 125 

Ser Met Cys Ser Leu Leu Ala He Ala Val Asp Arg Tyr He Thr He 
130 135 140 

Phe Tyr Ala Leu Arg Tyr His His He Met Thr Ala Arg Arg Ser Gly 
145 150 155 160 

Val He He Ala Cys He Trp Thr Phe Cys He Ser Cys. Gly He Val 
165 170 175 
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Phe lie lie Tyr Tyr Glu Ser Lys Tyr Val lie He Cys Leu lie Ser 



180 



IBS 



190 



Met Phe Phe Thr Met Leu Phe Phe Met Val Ser Leu Tyr He His Met 
» 135 200 205 

Phe Leu Leu Ala Arg Asn His Val" Lys Arg He Ala Ala Ser Pro Aro 
210 215 220 

Tyr Asn Ser Val Arg Gin Arg Thr Ser Met Lys. Gly Ala He Thr Leu 
225 230 235 240 

Thr Met Leu Leu Gly He Phe He Val Cys Trp Ser Pro Phe Phe Leu 
245 250 255 

His Leu He Leu Met He Ser Cys Pro Gin Asn Val Tyr Cys Ser Cys 
260 265 270 

Phe Met Ser Tyr Phe Asn Met Tyr Leu He Leu He Met Cys Asn Ser 
- 275 280 285 

Val He Asp Pro Leu He Tyr Ala Leu Arg Ser Gin Glu Met Arg Ara 
290 295 300 

Thr Phe Lys Glu He Val Cys Cys His Gly Phe Arg Arg Pro Cys Arg 
305 310 315 * 320 

Leu Leu Gly Gly Tyr 
325 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 3 0 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

TTD MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

__ <A) NAME / KEY : misc^f eat.ure 

(B) LOCATION: 1..32 

(D) OTHER ^ INFORMATION: /function = "Degenerate 
oligonucleotide primer (antisense) " 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GAATTCGACG TCACAGTATG ACGGCCATGG 
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WHAT WE CLAIM IS: 

1 . A method for characterizing a compound as an agonist of a mammalian 
melanocortin receptor, the method comprising the steps of: 

(a) providing a panel comprising a first mammalian cell comprising a 
5 recombinant expression construct encoding a mammalian melanocortin receptor that is 

the a-MSH receptor, a second mammalian cell comprising a recombinant expression 
construct encoding a mammalian melanocortin receptor that is the ACTH receptor, a 
third mammalian cell comprising a recombinant expression construct encoding a 
mammalian melanocortin receptor that is the MC-3 receptor, a fourth mammalian cell 

] 0 comprising a recombinant expression construct encoding a mammalian melanocortin 

receptor that is the MC-4 receptor, and a fifth mammalian cell comprising a recombinant 
expression construct encoding a mammalian melanocortin receptor that is the MC-5 
receptor, wherein each mammalian cell expresses the melanocortin receptor encoded by 
the recombinant expression construct comprising the cell; 

15 (b) contacting each of the cells of the panel with a test compound to be 

characterized as an agonist of a mammalian melanocortin receptor, 

(c) detecting binding of the test compound to each of the mammalian 
melanocortin receptors by assaying for a metabolite produced in the cells that bind the 
compound. 

20 

2. The method of claim 1 , wherein the metabolite detected in subpart (c) is 
cyclic AMP. 

3. The method of claim 1, each of the cells further comprising a 
25 recombinant expression construct encoding a cyclic AMP responsive element (CRE) 

transcription factor binding site operatively linked to a nucleic acid sequence encoding 
a protein capable of producing a detectable metabolite. 

4. The method of claim 3, wherein the nucleic acid sequence encodes p- 
30 galactosidase. 
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5. The method of claim 3, wherein the recombinant expression construct is 
pCRE/p-galactosidase. 



6. The method of claim 3, wherein the detectable metabolite produced by 
the protein encoded by the recombinant expression construct is produced by binding of 
the test compound to the mammalian melanocortin receptor encoded by each of the cells 
of the panel. 

7. A method for characterizing a compound as an antagonist of a 
mammalian melanocortin receptor, the method comprising the steps of: 

(a) providing a panel comprising a first mammalian cell comprising a 
recombinant expression construct encoding a mammalian melanocortin receptor that is 
the a-MSH receptor, a second mammalian cell comprising a recombinant expression 
construct encoding a mammalian melanocortin receptor that is the ACTH receptor, a 
third mammalian cell comprising a recombinant expression construct encoding a 
mammalian melanocortin receptor that is the MC-3 receptor, a fourth mammalian cell 
comprising a recombinant expression construct encoding a mammalian melanocortin 
receptor that is the MC-4 receptor, and a fifth mammalian cell comprising a recombinant 
expression construct encoding a mammalian melanocortin receptor that is the MC-5 
receptor, wherein each mammalian cell expresses the melanocortin receptor encoded by 
the recombinant expression construct comprising the cell; 

(b) contacting each of the cells of the panel with an agonist of the 
mammalian melanocortin receptor in an amount sufficient to produce a detectable 
amount of a metabolite produced in the cells that bind the agonist, in the presence or 
absenc e of a test c ompound to b e characte rized as an antagonist of a mammalian 
melanocortin receptor; 

(c) detecting the amount of the metabolite produced in each cell in the panel 
in the presence of the test compound with the amount of the metabolite produced in each 
cell in the panel in the absence. 

8. The method of claim 7, wherein the metabolite detected in subpart (c) is 
cyclic AMP. 
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9. The method of claim 7, each of the cells further comprising a 
recombinant expression construct encoding a cyclic AMP responsive element (CRE) 
transcription factor binding site operatively linked to a nucleic acid sequence encoding 
a protein capable of producing a detectable metabolite. 

5 

10. The method of claim 9, wherein the nucleic acid sequence encodes P- 
galactosidase. 

11. The method of claim 9, wherein the recombinant expression construct is 
10 pCRE/p-galactosidase. 

12. The method of claim 9, wherein the detectable metabolite produced by 
the protein encoded by the recombinant expression construct is produced by binding of 
the test compound to the mammalian melanocortin receptor encoded by each of the cells 

15 of the panel. 

13. The method of claim 1 wherein the test compound is an agonist of the 
MC-3 mammalian melanocortin receptor. 

20 14. The method of claim 1 wherein the test compound is an agonist of the 

MC-4 mammalian melanocortin receptor. 

15. The method of claim 3 wherein the test compound is an agonist of the 
MC-3 mammalian melanocortin receptor. 

25 

16. The method of claim 3 wherein the test compound is an agonist of the 
MC-4 mammalian melanocortin receptor. 

1 7. The method of claim 7 wherein the test compound is an antagonist of the 
30 MC-3 mammalian melanocortin receptor. 
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1 8. The method of claim 7 wherein the test compound is an antagonist of the 
MC-4 mammalian melanocortin receptor. 

1 9. The method of claim 9 wherein the test compound is an antagonist of the 
MC-3 mammalian melanocortin receptor. 

20. The method of claim 9 wherein the test compound is an antagonist of the 
MC-4 mammalian melanocortin receptor. 

21 . A mammalian melanocortin MC-3 receptor agonist according to claims 

13 or 15. 

22. A mammalian melanocortin MC-4 receptor agonist according to claims 

14 or 16. 

23. A mammalian melanocortin MC-3 receptor antagonist according to 
claims 1 7 or 1 9. 

24. A mammalian melanocortin MC-4 receptor antagonist according to 
claims 1 8 or 20. 

25. A method of inhibiting feeding behavior in an animal, the method 
comprising administering an effective amount of a mammalian melanocortin MC-3 or 
MC-4 receptor agonist according to claim 21. 

26. A~ i^thod W feeding behavior in an animal, the method 
comprising administering an effective amount of a mammalian melanocortin MC-3 or 
MC-4 receptor antagonist according to claim 24. 

27. A method for characterizing a mammalian melanocortin MC-3 or MC-4 
receptor agonist as an inhibitor of feeding behavior in an animal, the method comprising: 
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(a) providing food to an animal that has been deprived of food for at least 12 
hours with or without administering to the animal a mammalian melanocortin MC-3 or 
MC-4 receptor agonist according to claim 25; and 

(b) comparing the amount of food eaten by the animal with and without 
5 administration of the mammalian melanocortin MC-3 or MC-4 receptor agonist. 

28. A method for characterizing a mammalian melanocortin MC-3 or MC-4 
receptor antagonist as a stimulator of feeding behavior in an animal, the method 
comprising: 

10 (a) providing food to an animal that has not been otherwise deprived of food 

for al least 12 hours, with or without administering to the animal a mammalian 
melanocortin MC-3 or MC-4 receptor antagonist according to claim 26 immediately 
prior to the onset of darkness or nighttime; and 

(b) comparing the amount of food eaten by the animal with and without 

15 administration of the mammalian melanocortin MC-3 or MC-4 receptor antagonist. 

29. A mammalian melanocortin MC-3 or MC-4 receptor agonist having the 
general formula: 

A-B-C-D-E-F-G-amide 
wherein A is Leu, lie, Nle, Met, or substituted analogues thereof; 
B is Asp, Glu , or substituted analogues thereof; 
C is His or substituted analogues thereof; 
D is D-Phe, D-Tyr or substituted analogues thereof; 
E is Arg, Lys, homoArg, homoLys, or substituted analogues thereof; 
F is Trp or substituted analogues thereof; 
G is Lys, homoLys or substituted analogues thereof; 
and wherein the peptide is cyclized by the formation of an amide bond between the side 
chain carboxyl group of the Asp or Glu residue at position B in the peptide, and the side 
chain amino group of the Lys or homoLys residue at position G. 



20 



25 
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30. A mammalian melanocortin MC-3 or MC-4 receptor antagonist having 
the general formula: 

A-B-C-D-E-F-G-amide 
wherein A is Leu, lie, Nle, Met, or substituted analogues thereof; 
B is Asp, Glu or substituted analogues thereof; 
C is His or substituted analogues thereof; 
D is D-Nal or substituted analogues thereof; 

E is Arg, Lys, homoArg, homoLys or substituted analogues thereof; 

F is Trp or substituted analogues thereof; 

G is Lys, homoLys or substituted analogues thereof; 
and wherein the peptide is cyclized by the formation of an amide bond between the side 
chain carboxyl group of the Asp or Glu residue at position B in the peptide, and the side 
chain amino group of the Lys or homoLys residue at position G. 

31. A biological screening panel for determining the receptor 
agonist/antagonist profile of a test compound, the panel comprising a first mammalian 
cell comprising a recombinant expression construct encoding a mammalian melanocortin 
receptor that is the a-MSH receptor, a second mammalian cell comprising a 
recombinant expression construct encoding a mammalian melanocortin receptor that is 
the ACTH receptor, a third mammalian cell comprising a recombinant expression 
construct encoding a mammalian melanocortin receptor that is the MC-3 receptor, a 
fourth mammalian cell comprising a recombinant expression construct encoding a 
mammalian melanocortin receptor that is the MC-4 receptor, and a fifth mammalian cell 
comprising a recombinant expression construct encoding a mammalian melanocortin 
receptor that is the MC-5 receptor, wherein each mammalian cell expresses the 
melanocortin receptor encoded by the recombinant expression construct comprising the 
cell. 
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FIG. 1A 



TTCCTGACAA OACT ATG TCC ACT CAO GAG CCC CAO AAQ AGT CTT CTO GOT 
Met Ser Thr Gin Qlu Pro Oln Lys Ser Leu Leu Oly 
1 5 10 



SO 



TCT CTC AAC TCC AAT OCC ACC TCT CAC CTT .GO A CTO OCC ACC AAC CAO 98 
ser Leu Asn Ser Asn Ale Thr Ser Kis Leu Gly Leu Ala Thr Asn Oln 
15 20 25 

TCA GAG CCT TGG TOC CTG TAT GTO TCC ATC CCA OAT OGC CTC TTC CTC 146 
Ser Qlu Pro Trp Cys Leu Tyr Val Ser lie Pro Asp Gly Leu Phe Leu 
30 35 « 

AOC CTA GOG CTG GTO AOT CTG GTG GAG AAT GTO CTO GTT GTG ATA OCC 194 
Ser Leu Gly Leu Val Ser Leu Val Glu Asn Val Leu Val Val He Ala 
45 50 55 60 

ATC ACC AAA AAC COC AAC CTG CAC TC0 CCC ATG TAT TAC TTC ATC TOC 242 
He Thr Lys Asn Arg Asn Leu His Ser Pro Met Tyr Tyr Phe He Cys 

70 75 



65 



TOC CTO OCC CTG TCT GAC CTG ATG GTA AOT GTC AOC ATC OTO CTG GAG 290 
Cys Leu Ala Leu Ser Asp Leu Met Val Ser Val Ser He Val Leu Glu 
60 85 90 

ACT ACT ATC ATC CTG CTG CTG GAG GTO OGC ATC CTO GTG OCC AOA GTO 338 
Thr Thr He He Leu Leu Leu Glu Val Gly He Leu Val Ala Arg Val 
95 100 105 

OCT TIG GTG CAO CAO CTG GAC AAC CTC ATT GAC GTO CTC ATC TGT OOC 386 
Ala Leu Val Gin Oln Leu Asp Asn Leu He Asp Val Leu He Cys Oly 
110 H5 120 

TCC ATG GTG TCC AOT CTC TOC TTC CTO OGC ATC ATT OCT ATA GAC COC 434 
Ser Met Val Ser Ser Leu Cys Phe Leu Gly He He Ala He Asp Arg 
125 "0 13S 140 

-TAC ATC TCC ATC- TTC „TO AOC ATC OTO ACG CTO 482 

Tvr He Ser He Phe Tyr Ala Leu Arg Tyr His Ser lie Val thr Leu 
145 ISO 15S 

CCC AOA OCA COA COG OCT GTC GTO OOC ATC TOO ATG GTC AOC ATC GTC 530 
Pro Arg Ala Arg Arg Ala Val Val Oly He Trp Met Val Ser He Val 
160 165 170 
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FIG. IB 



TCC AOC ACC CTC TTT ATC ACC TAC TAC AAO CAC ACA OCC OTT CTO CTC 578 
Ser Ser Thr Leu Phe lie Thr Tyr Tyr Lye His Thr Ala Val Leu Leu 
175 160 185 

toc ctc arc act ttc ttt cta occ ato era oca ctc ato oco att era 626 
Cya Leu Val Thr Phe Phe Leu Ala Met Leu Ala Leu Met Ala lie Leu 
190 195 200 

TAT OCC CAC ATO TTC ACQ AOA OCO TOC CAO CAC OTC CAO OOC ATT OCC 674 
Tyr Ala Hie Met Phe Thr Arg Ala Cye Oln Hie Val Oln Oly lie Ala 
205 210 21S 220 

GAG CTC CAC AAA AGO COG COO TCC ATC COC CAA OOC TTC TOC CTC AAO 722 
Oln Leu Hie Lye Arg Arg Arg Ser lie Arg Oln Oly Phe Cye Leu Lye 
225 230 235 

GOT OCT OCC ACC CTT ACT ATC CTT CTO GOO ATT TTC TTC CTG TOC TOO 770 
Oly Ala Ala Thr Leu Thr He Leu Leu Oly lie Phe Phe Leu Cye Trp 
240 245 250 

OOC CCC TTC TTC CTO CAT CTC TTC CTC ATC OTC CTC TOC CCT CAO CAC 618 
Oly Pro Phe Phe Leu Hie Leu Leu Leu Xle Val Leu Cye Pro Gin Hie 
255 260 265 

CCC ACC TOC AOC TOC ATC TTC AAO AAC TTC AAC CTC TTC* CTC CTC CTC 866 
Pro Thr Cye Ser Cye Xle Phe Lye Asn Phe Aan Leu Phe Leu Leu Leu 
270 275 280 

ATC OTC CTC AOC TCC ACT OTT OAC CCC CTC ATC TAT OCT TTC COC AOC 914 
He Val Xjeu Ser Ser Thr Val Asp Pro Leu He Tyr Ala Phe Arg Ser 
285 290 295 300 

CAO OAO CTC COC ATO ACA CTC AAO GAG OTO CTO CTO TOC TCC TOO 959 
Oln Olu Leu Arg Met Thr Leu Lye Glu Val Leu Leu Cys Ser Trp 
305 310 315 



TOATCAOAOO GCGCTGOGCA GAGGGTGACA GTOATATCCA OTOOCCTOCA TCTOTGAOAC 1019 

CACAOOTACT CATCCCTTCC TOATCTCCAT TTOTCTAAOO OTCOACAOOA TOAOCTTTAA 1079 

AATAOAAACC CAOAOTOCCT OGGGCCAOGA OAAAOOOTAA CTOTGACTOC AOOQCTCACC 1139 

CAOOOCAGCT AC00OAAOTO GAOGAGACAO GOATGGOAAC TCTAGCCCTO AOCAAQGOTC 1199 

AOACCACAOG CTCCTGAAOA GCTTCACCTC TCCCCACCTA CAOOCAACTC CTOCTCAAOC 1259 

C 1260 



SUBSTITUTE SHEET (RULE 26) 



WO 98/10068 



PCT/US97/15565 



FIG. 2 A 



CCCGCATOTG OCCOCCCTCA AT0GA0OOCT CTGAGAACGA CTTTTAAAAC OCAGAOAAAA 60 

AOCTCCATTC TTCCCAGACC TCAOCOCAOC CCTOGCCCAG QAAOGOAGGA OACAOAOOCC 120 

AOOACOOTCC AOAOOTOTCa AAATGTCCTG GGAACCTGAG CAOCAGCCAC CAQOOAAOAO 180 

aCAQOOAGaa AaCTOAOOAC CAGGCTTGGT TGTGAGAATC CCTGAGCCCA OOCOaTTaAT 240 

OCCAOOAOOT OTCTOOACTG GCTOOGCCAT OCCTOOOCTO ACCTOTCCAG CCAGGGAOAG 300 

OOTOTOAOGO CAOATCTOOG GOTOCCCAGA TGGAAGGAGG CAOGCATGGQ GACACCCAAG 360 

OCCCCCTOOC AOCACCATGA ACTAAOCAOQ ACACCTGGAG OOOAAOAACT GTGGGGACCT 420 

GGAGGCCTCC AACOACTCCT TCCTOCTTCC TGGACAGGAC T AT0 GCT GTG CAG 473 

Met Ala Val Gin 

■ 1 

GGA TCC CAG AGA AGA CTT CTG GGC TCC CTC AAC TCC ACC CCC ACA GCC 521 
Gly Ser Gin Arg Arg Leu Leu Gly Ser Leu Acn Ser Thr Pro Thr Ala 
5 10 IS 20 

ATC CCC CAG CTG OGG CTG GCT GCC AAC CAG ACA GGA GCC CGG TOC CTG S69 
Zle Pro Gin Leu Gly Leu Ala Ala Asn Qln Thr Gly Ala Arg Cye Leu 
25 30 35 

GAG GTG TCC ATC TCT GAC GGG CTC TTC CTC AGC CTG GGG CTG GTG AOC 617 
Glu Val Ser He Ser Asp Gly Leu Phe Leu Ser Leu Gly Leu Val Ser 
40 45 SO 

TTG GTG GAG AAC GOG CTG GTG GTG GCC ACC ATC GCC AAG AAC CGG AAC 665 
Leu Val Glu Asn Ala Leu Val Val Ala Thr He Ala Lya Asn Arg Asn 
55 60 65 

CTG CAC TCA CCC ATG TAC TOC TTC ATC TOC TOC CTG GCC TTG TCG GAC 713 
Leu His Ser Pro Met Tyr Cye Phe He Cye Cye Leu Ala Leu Ser Asp 
70 75 80 

CTG CTG GTG AGC GGG ACQ AAC GTG CTG GAG ACQ GCC GTC ATC CTC CTG 761 
Leu Leu Val Ser Gly Thr Asn Val Leu Glu Thr Ala Val lie Leu Leu 
«5 90 95 10b 

CTG GAG GCC GOT GGA CTG GTG GCC CGG OCT GOG GTG CTG CAG CAG CTG 809 
Leu Glu Ala Gly Ala Leu Val Ala Arg Ala Ala Val Leu Gin Gin Leu 
105 HO 115 

GAC AAT GTC ATT GAC GTG ATC ACC TOC AOC TCC ATG CTG TCC AOC CTC 657 
Asp Asn Val He Asp Val He Thr Cye Ser Ser Met Leu Ser Ser Leu 

120* 125 130 

TOC TTC CTG GGC GCC ATC GCC GTG GAC CGC TAC ATC TCC ATC TTC TAC 905 
Cye Phe Leu Gly Ala. He Ala Val Asp Arg Tyr He Ser He Phe Tyr 
13S 140 145 

OCA CTG CGC TAC CAC AGC ATC GTG ACC CTG CCG CGG OCQ CCG CGA GCC 953 
Ala Leu Arg Tyr His Ser He Val Thr Leu Pro Arg Ala Pro Arg Ala 
150 iss 160 
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FIG. 2B 



OTT OCO OCC ATC TOO GPPQ OCC AOT OTC GTC TTC AOC ACQ CTC TTC ATC 1001 
Val Ala Ala lie Trp Val Ala Ser Val Val Phe Ser Thr Leu Phe lie * 
1« "0 175 ieo 

OCC TAC TAC OAC CAC OTO OCC OTC CTO CTO TOC CTC OTO OTC TTC TTC 
Ala Tyr Tyr Asp Hia Val Ala Val Leu Leu Cys Leu Val Val Phe Phe 
185 190 195 

CTO OCT AT0 CTO OTO CTC ATO OCC OTO CTO TAC OTC CAC ATO CTO OCC 
Leu Ala Met Leu Val Leu Met Ala Val Leu Tyr Val His Met Leu Ala 
200 205 210 

COO OCC TOC CAQ CAC OCC CAG OOC ATC OCC COO CTC CAC AAO AGO CAO 
Arg Ala Cys Oln His Ala Gin Oly lie Ala Arg Leu His Lys Arg oln 
215 220 225 

COC CCD OTC CAC CAO OOC TTT OOC CTT AAA OOC OCT OTC ACC CTC ACC 
Arg Pro Val His Oln Oly Phe Oly Leu Lys Oly Ala Val Thr Leu Thr 
230 23S 240 

ATC CTC CTO OOC ATT TTC TTC CTC TGC TOG OOC CCC TTC TTC CTO CAT 
He Leu Leu Oly He Phe Phe Leu Cys Trp Oly Pro Phe Phe Leu His 
24S 250 255 260 

CTC ACA CTC ATC OTC CTC TOC CCC GAG CAC CCC ACQ TGC GGC TOC ATC 
Leu Thr Leu He Val Leu Cys Pro Olu His Pro Thr Cys Oly Cys He 
265 270 275 

TTC AAO AAC TTC AAC CTC TTT CTC OCC CTC ATC ATC TOC AAT OCC ATC 1337 
Phe Lys Asn Phe Asn Leu Phe Leu Ala Leu He He Cys Asn Ala lie 
280 285 290 

ATC OAC CCC CTC ATC TAC OCC TTC CAC AGC CAO GAG CTC CGC AGO ACG 
He Asp Pro Leu He Tyr Ala Phe His Ser Gin Glu Leu Arg Arg Thr 
295 300 305 

CTC AAO GAG OTO CTO ACA TGC TCC TOO TOAOCOCOOT GCACaCGCTT 
Leu Lys olu Val Leu Thr Cys Ser Trp 
310 315 



TAAGTOTOCT OOGCAOAOOG AGOTGGTGAT ATTGTOGTCT GOTTCCTOTQ TQACCCTGOG 
CAOTTCCTTA CCTCCCTOOT CCCCGTTTGT CAAAGAOGAT GGACTAAATO ATCTCTOAAA 
GTOTTGAAGC OCOGACCCTT CTOGGCAGGG AOOGGTCCTG OUUACTCCA GOCAGGACTT 
CTCACCAOC* OTCGTGGGAA C 



1049 



1097 



1145 



1193 



1241 



1289 



138S 

1432 

1492 
1552 
1612 
1633 
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FIG. 3 A 



€0 
120 

leo 

240 

300 
360 



480 
540 



ACAACACTTT ATATATATTT TTATAAATOT AAGGGGTACA AAGGTGCCAT TTTGTTACAT 

OOATATACCO TQTAOTOOTO AAGCCTGGGC TTTTAOTOTA TCTQTCATCA OAATAACATA 

COTOTTACCC ATAOOAATTT CTCATCACCC OCXTCCCTCCA CCCTTCOAOT CTCCAATGTC 

CATTCCACAC TCTATATCCA COTGTATOCA TATAOCTCCA CATATAAGTQ AGAACATGTA 

OTATTTOACT TCCTCTTTCT OAOTTATTTC ACTTTOATAA TGGCCTCCAC TTCCATCCAT 

OTTOCTOCAA AAQACATGAC CTTATTCTTT TTQATAQCTO OGGAGTACTC CATTQTOTAT 

ATOTACCACA TTTCTTTATC CATTCACCCA TTGAOAACAC TTAOTTOATT CCATATCTTT 420 

OCTATTOTCA CTAOTGCTGC AATAAACATA CATOT0CAGG CTCCTTCTAA TATACTQATT 

TATATTTTAT OGAGAGAGAT AOAOTTCTTA GCOAGTOTQC TOTTTATTTC TAOTOTACTT 

OCAACTAATA TTCTGTATAC TCCCTTTAGG TGATTGGAGA TTTAACTTAG ATCTCCAOCA 600 

AGTGCTACAA GAAGAAAAGA TCCTGAAGAA TCAATCAAGT TTCCGTGAAG TCAAGTCCAA 660 

OTAACATCCC CQCCTTAACC ACAAOCAGOA OAA ATQ AAG CAC ATT ATC AAC TCO 714 

Met Lye His lie lie Asn Ser 
1 5 

TAT OAA AAC ATC AAC AAC ACA GCA AOA AAT AAT TCC OAC TOT CCT COT 762 
Tyr alu Asn lie Aan Asn Thr Ala Arg Asn Ann Ser Asp Cye Pro Arg 
10 is 20 

OTO GTT TTG CCO GAG GAG ATA TTT TTC ACA ATT TCC ATT GTT OGA GTT 810 
Val Val Leu Pro Glu Glu lie Phe Phe Thr He Ser He Val Gly Val 
25 30 35 

TTG GAG AAT CTG ATC GTC CTG CTQ GCT GTQ TTC AAG AAT AAG AAT CTC 856 
Leu Glu Aan Leu He Va.1 Leu Leu Ala Val Phe Lys Asn Lye Asn Leu 
*° *S 50 55 

CAO GCA CCC ATQ TAC TTT TTC ATC TOT AGC TTO OCC ATA TCT GAT ATG 906 
Gin Ala Pro Met Tyr Phe Phe lie Cye Ser Leu Ala lie" Ser Asp Met 
60 65 70 

CTG GGC AGC CTA TAT AAG ATC TTG OAA AAT ATC CTG ATC ATA TTG AOA 954 
Leu Gly Ser Leu Tyr Lys He Leu Glu Aan He Leu He He Leu Arg 
75 80 85 

AAC ATG GGC TAT CTC AAG CCA CGT GGC AGT TTT OAA ACC ACA GCC GAT 1002 
Aan Met Gly Tyr Leu Lys Pro Arg Gly Ser Phe Glu Thr Thr Ala Asp 
*0 95 100 

GAC ATC ATC GAC TCC CTG TTT GTC CTC TCC CTG CTT GGC TCC ATC TTC 
Asp He lie Asp ser Leu Phe Val Leu Ser Leu Leu Gly Ser He Phe 
105 no us 
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FIG. 3B 



aoc era tct crro att oct oca gac coc tac atc acc atc ttc cac oca io98 
Ser Leu Ser Val He Ala Ala Asp Arg Tyr lie Thr He Phe Hie Ala 
"0 125 lio las 

CTO CGG TAC CAC AOC ATC OT0 ACC ATO COC CQC ACT OT0 OT0 GTO CTT 1146 
Leu Arg Tyr Hie Ser He Val Thr Met Arg Arg Thr Val Val Val Leu 
1«0 145 ISO 

ACO OTC ATC TOO ACQ TTC TOC ACQ GOO ACT OOC ATC ACC ATO OTO ATC 1194 
Thr Val He Trp Thr Phe Cye Thr Oly Thr Oly He Thr Met Val He 
155 160 165 

TTC TCC CAT CAT GTO CCC ACA OTO ATC ACC TTC ACO TCG CTG TTC CCG " 1242 

Phe Ser Hie Hie Val Pro Thr Val He Thr Phe Thr Ser Leu Phe Pro 
170 175 160 

CTG ATO CTO OTC TTC ATC CTO TOC CTC TAT GTG CAC ATO TTC CTO CTO 1290 
Leu Met Leu Val Phe He Leu Cye Leu Tyr Val Hie Met Phe Leu Leu 
185 190 195 

OCT COA TCC CAC ACC AGO AAO ATC TCC ACC CTC CCC AGA OCC AAC ATO 1338 
Ala Arg Ser His Thr Arg Lye He Ser Thr Leu Pro Arg Ala Aen Met 
200 205 210 215 

AAA GOG GCC ATC ACA CTG ACC ATC CTG CTC GGG GTC TTC ATC TTC TOC 1386 
Lye Oly Ala He Thr Leu Thr He Leu Leu Gly Val Phe He Phe Cye 
220 225 230 

TOG GCC CCC TTT GTG CTT CAT GTC CTC TTG ATG ACA TTC TOC CCA AGT 1434 
Trp Ala Pro Phe Val Leu His Val Leu Leu Met Thr Phe Cys Pro Ser 
235 240 245 

AAC CCC TAC TOC GCC TOC TAC ATG TCT CTC TTC GAG GTO AAC OOC ATG 1482 
Aen Pro Tyr Cys Ala Cye Tyr Met Ser Leu Phe Gin Val Aen Gly Met 
250 255 260 

TTG ATC ATO TOC AAT GCC GTC ATT OAC CCC TTC ATA TAT GCC TTC COG 
Leu^ He Met Cys Aen Ala Val lie Aap Pro Phe He T yr Ala Phr Arg 
265 270 275 



1530 



AOC CCA GAG CTC AGO OAC OCA TTC AAA AAO ATG ATC TTC TOC AGC AGO 1578 
Ser Pro Glu Leu Arg Asp Ala Phe Lye Lye Met lie Phe Cys Ser Arg 
250 285 290 295 

TAC TOO TAGAATOGCT GATCCCTOGT TTTAGAATCC ATGGGAATAA CGTTGCCAAO 1634 
Tyr Trp 

TGCCAGAATA GTOTAACATT CCAACAAATO CCAQTGCTCC TCACTGGCCT TCCTTCCCTA 1694 

ATGGATGCAA GOATGACCCA CCAOCTAOTG TTTCTGAATA CTATOGCCAG GAACAGTCTA 1754 

TTGTAOGGOC AACTCTATTT GTGACTGOAC AGATAAAACG TGTAGTAAAA GAAGGATAGA 1814 

ATAC AAAQTA TTAGGTACAA AAGTAATTAG GTTTGCATTA CTTATGACAA ATOCATTACT 1874 

TTTOCACCAA TGTAGTAAAA CAGCAATAAA AATTCAAGGG CTTTOGGCTA AOOCAAAGAC 1934 

TTOCTTTCCT GTOGACATTA ACAAOCCAGT TCTGAGOCGG CCTTTCCAGG TGOAGGCCAT 1994 
TGCAGCCAAT TTCAGAGT 



2012 
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FIG. 4 A 



QOOOCCAOAA AGTTCCTGCT TCAOAGCAGA AOATCTTCAG CAAGAACTAC AAAGAAGAAA 60 

AGATTCTGOA GAATCAATCA AOTTTCCTGT CAAGTTCCAO TAACGTTTCT OTCTTAACTO 120 

CACACAOGAA AG ATO AAA CAC ATT CTC AAT CTG TAT QAA AAC ATC AAC 168 
Met Lye His He Leu Ann Leu Tyr Glu Aen He A en 
1 5 .10 

AOT ACA OCA AGA AAT AAC TCA GAC TOT CCT OCT GTG ATT TTG CCA OAA 216 
Ser Thr Ala Arg Aan Aan Ser Asp Cys Pro Ala Val He Leu Pro Glu 
15 20 25 

GAG ATA TTT TTC ACA GTA TCC ATT GTT GGG GTT TTG GAG AAC CTG ATG 264 
Glu He Phe Phe Thr Val Ser He Val Gly Val Leu Glu Aan Leu Met 
30 35 40 

GTC CTT CTO GCT GTG GCC AAG AAT AAG AGT CTT CAG TCG CCC ATG TAG 312 
Val Leu Leu Ala Val Ala Lys Aan Lye Ser Leu Gin Ser Pro Met Tyr 
45 50 55 60 

TTT TTC ATC TGC AOC TTG GCT ATT TCC GAT ATG CTG GGG AGC CTG TAC 360 
Phe Phe He Cye Ser Leu Ala He Ser Aep Met Leu Gly Ser Leu Tyr 
€5 70 75 

AAG ATT TTG GAA AAC GTT CTG ATC ATG TTC AAA AAC ATG GOT TAC. CTC 408 
Lye He Leu Glu Aan Val Leu He Met Phe Lya Asn Met Gly Tyr Leu 
80 85 90 

GAG CCT CGA GGC AGT TTT GAA AGC ACA GCA GAT GAT GTG GTG GAC TCC 456 
Glu Pro Arg Gly Ser Phe Glu Ser Thr Ala Aep Aep Val Val Asp Ser 
95 100 105 

CTG TTC ATC CTC TCC CTT CTC GGC TCC ATC TGC AOC CTG TCT GTG ATT 504 
Leu Phe He Leu Ser leu Leu Gly Ser He Cye Ser Leu Ser Val He 
110 115 120 

GCC GCT GAC CGC TAC ATC ACA ATC TTC CAC GCT CTG CAG TAC CAC CGC S52 
Ala Ala Asp Arg Tyr He Thr He Phe Hie Ala Leu Gin Tyr His Arg 
125 130 135 140 

ATC ATG ACC CCC GCA CCG TGC CCT CGT CAT CTG ACQ GTC CTC TGO GCA 600 
He Met Thr Pro Ala Pro Cye Pro Arg Hie Leu Thr Val Leu Trp Ala 
145 150 155 

GGC TGC ACA GGC AGT GGC ATT ACC ATC GTG ACC TTC TCC CAT CAC GTC 648 
Gly Cye Thr Gly Set Gly '"He "Thr lie VaT Thr Phe Ser Hie His Val 
160 165 170 

CCC ACA GTG ATC GCC TTC ACA GCG CTG TTC CCG CTG ATG CTG GCC TTC 696 
Pro Thr Val He Ala Phe Thr Ala Leu Phe Pro Leu Met Leu Ala Phe 
175 180 185 

ATC CTG TGC CTC TAC GTG CAC ATG TTC CTG CTG GCC CGC TCC CAC ACC 744 
He Leu Cye Leu Tyr Val His Met Phe Leu Leu Ala Arg Ser His Thr 
190 195 200 

AGO AGO ACC CCC TCC CTT CCC AAA GCC AAC ATG AGA GGG GCC GTC ACA 792 
Arg Arg Thr Pro Ser Leu Pro Lys Ala Aen Met Arg Gly Ala Val Thr 
205 210 215 220 
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FIG. 4B 



era act gtc era ctc ogg arc ttc att ttc tot too oca ccc ttt otc e*o 

Lieu Thr Val Leu Leu Oly VaX Phe He Phe Cye Trp Ala Pro Phe Val 
225 230 235 

err cat otc ctc tto ato aca ttc toc cca oct gac ccc tac tot occ see 

LeuKis Val Leu Leu Met Thr Phe Cyo Pro Ala Aap Pro Tyr Cya Ala 
240 245 250 

TOC TAC ATO TCC CTC TTC CAG GTO AAT GOT OTO TTO ATC ATO TGT AAT 936 
Cye Tyr Met Ser Leu Phe Gin Val Aan Oly Val Leu He Met Cye Aan 
255 260 265 

GCC ATC ATC GAC CCC TTC ATA TATGCC TTT COO AOC CCA QAO CTC AGO SB< 
Ala lie lie Asp Pro Phe lie Tyr Ala Phe Arg Ser Pro Olu Leu Arg 
270 275 260 

OTC OCA TTC AAA AAO ATO OTT ATC TOC AAC TGT TAC CAG TAGAATGATT 1033 
Val Ala Phe Lys Lye Met Val tie Cya Asn Cys Tyr Oln 
285 290 295 

GOTCCCTGAT TTTAGOAGCC ACAGGGATAT ACTGTCAGGG ACAOAOTAGC OTGACAGACC 1093 
AACAACACTA GOACT 110e 
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FIG. 5 A 



GQCTGTAACT GTAOCAACCO GTGTTOOOTG OOGATGAGAA GAOACCAGAG AQAOAOAOOO €0 

TCAGAGCGAC AGGOGATGAG ACAOGCTOOT CAGAOTCTOC ACTGATTGTT GGAGACGCAA 120 

AGGAAAGTTT TTTCTATOTC TCCAACCTCC CCCTCCTCCC CCOTTTCTCT CTOOAOAAAC 

TAAAATCTAO ACTOGACAGC ATCCACAAOA OAAOCACCTA GAAGAAOATT TTTTTTTCCC 

AOCAOCTTQC TCAOOACCCT aCAQGAGCTG CAOCCOOAAC TGGTCCCGCC GATAACC 

ATG AAC TCT TCC TGC TOC CCO TCC TCC TCT TAT CCG ACQ CTG CCT AAC 
Met Asa Ser Ser Cys Cys Pro Ser Ser Ser Tyr Pro Thr Leu Pro Asn 
IS 10 ls 

CTC TCC CAO CAC CCT OCA OCC CCC TCT OCC AOC AAC COG AQT OOC ACT 
Leu Ser Gin His Pro Ala Ala Pro Ser Ala Ser Asn Arg Ser Gly Ser 
20 25 30 

QOG TTC TOC GAG CAO OTT TTC ATC AAO CCA GAG GTC TTC CTG OCA CTG 
Gly Phe Cye Glu Gin Val Phe He Lys Pro Olu Val Phe Leu Ala Leu 
35 40 4S 

OGC ATC GTC AOT CTG ATO OAA AAC ATC CTG OTG ATC CTG OCT GTG GTG 
Gly He Val Ser Leu Met Glu Asn He Leu Val He Leu Ala Val Val 
50 55 60 

AOG AAC OGC AAC CTG CAC TCC CCC ATG TAC TTC TTC CTG CTG AOC CTG 
Arg Asn Gly Asn Leu His Ser Pro Met Tyr Phe Phe Leu Leu Ser Leu 
« 70 7S 80 

CTG CAG OCC GAC ATG CTG GTG AOC CTG TCC AAC TCC CTG GAG ACC ATC 585 
Leu Gin Ala Asp Met Leu Val Ser Leu Ser Asn Ser Leu Glu Thr He 
85 90 95 

ATG ATC OTG OTT ATC AAC AGC GAC TCC CTG ACC TTG GAG GAC CAA TTC 633 
Met He Val Val He Asn Ser Asp Ser Leu Thr Leu Glu Asp Gin Phe 
100 105 1X0 ' 

ATC CAO CAC ATG GAC AAC ATC TTC GAC TCT ATG ATC TOC ATC TCC CTG 6B1 
He Gin His Met Asp Asn He Phe Asp Ser Met He Cys He Ser Leu 
115 120 125 

GTG OCC TCC ATC TOC AAC CTC CTG OCC ATC GCC GTG GAC AOG TAC GTC 729 
Val Ala Ser He Cys Asn Leu Leu Ala He Ala Val Asp Arg Tyr Val 
130 135 140 



160 
240 
297 
345 

393 

441 

469. 

S37 



ACC ATC TTC TAT GCC CTC COT TAC CAC AOC ATC ATG ACQ GTT AOG AAA 777 
Thr He Phe Tyr Ala Leu Arg Tyr His Ser He Met Thr Val Arg Lys 
145 "0 15S * X J 6Q 

GCC CTC TCC TTG ATC OTG OCC ATC TOO GTC TOC TOT OOC ATC TOC OOC 
Ala Leu Ser Leu He Val Ala He Trp Val Cys Cys Gly He Cys Gly 
I" 170 175 

GTG ATG TTC ATC GTC TAC TCC GAG AGC AAG ATG GTC ATC GTG TOC CTC 873 
Val Met Phe He Val Tyr Ser Glu Ser Lye Met Val lie Val Cys Leu 
1B ° 18S 190 



625 
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FIG. 5B 



ATC ACC ATG TTC TTC GCC ATO GTG CTC CTC ATO OOC ACC CTO TAC ATC 921 
lie Thr Met Phe Phe Ala Met Val Leu Leu Met Gly Thr Leu Tyx He 
195 200 205 

CAC ATO TTC CTC TTC GCC AGO CTO CAC GTC CAO COC ATC GCG OCA CTO 969 
Hie Met Phe Leu Phe Ala Arg Leu Kia Val Gin Arg He Ala Ala Leu 
210 215 220 

CCA CCT OCT OAC GOG GTA GCC CCO CAO CAO CAC TCO TOC ATO AAO OGG 1017 
Pro Pro Ala Asp Gly Val Ala Pro Gin Gin Hie Ser Cys Met Lys Oly 
225 230 235 240 

GCC GTC ACC ATC ACC ATC CTO CTO GOG GTT TTC ATC TTC TOC TOO OCG 1065 
Ala Val Thr He Thr He Leu Leu Oly Val Phe He Phe Cys Trp Ala 
245 250 255 

CCT TTC TTC CTC CAC CTC GTC CTC ATC ATC ACC TOC CCC ACC AAC CCC 1113 
Pro Phe Phe Leu His Leu Val Leu He He Thr Cys Pro Thr Asn Pro 
260 265 270 

TAC TOC ATC TOC TAC ACQ GCG CAC TTC AAC ACC TAC CTG GTT CTC ATC 1161 
Tyr Cys He Cys Tyr Thr Ala Hie Phe Asa Thr Tyr Leu Val Leu He 
275 2B0 265 

ATO TOC AAC TCT GTC ATC OAC CCC CTC ATC TAC GCC CTC COC AOC CTO 1209 
Met Cys Asn Ser Val He Asp Pro Leu He Tyr Ala Phe Arg Ser Leu 
290 295 300 

QAO CTO COA AAC ACC TTC AAO OAO ATT CTC TOC GOT TOC AAT OOC ATO 1257 
Glu Leu Arg Asn Thr Phe Lys Glu He Leu Cys Gly Cys Asn- Gly Met 
305 310 315 320 

AAC GTO OOC TAGGAACCCC CGAGOAGGTG TTCCACOOCT AGCCAAGAOA 1306 
Asn Val Oly 

OAAAAQCAAT GCTCAOOTOA OACACAGAAG GO 1336 
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FIG. 6 A 



AOCTTCCGAG AOGCAGCCGA TGTOAOCATa TGCGCACAGA TTCOTCTCCC- AATOGCATOG 60 

CAGCTTCAAG GAAAATTATT TTGAACAGAC TTGAATOCAT AAGATTAAAG TTAAAGCAGA 12 0 

AGTGAGAACA AGAAAOCAAA GA0CAOACTC TTTCAACTGA GAATGAATAT TTTGAAGCCC 160 

AAGATTTTAA AGTOATOATO ATTAOAGTCO TACCTAAAAQ AOACTAAAAA CTCCATGTCA 24 0 

AOCTCTQQAC TTGTOACATT TACTCACAQC AOGCATOOCA ATTTTAOCCT CACAACTTTC 300 

AGAGAGATAA AGACTTGGAG GAAATAACTO AGACOACTCC CTGACCCAGG AGGTTAAATC 360 

AATTCAGG00 GACACTGGAA TTCTCCTGCC AOC ATG OTO AAC TCC ACC CAC COT 414 

Met Val Asn Ser Thr Hie Arg 

1 5 

OOO ATG CAC ACT TCT CTG CAC CTC TOO AAC COC AOC AGT TAC AOA CTG 462 
Gly Met Hie Thr Ser Leu His Leu Trp Asn Arg Ser Ser Tyr Arg Leu 
10 IS 20 

CAC AGC AAT GCC AGT OAG TCC CTT GGA AAA GOC TAC TCT GAT GGA GOG 510 
His Ser Asn Ala Ser Glu Ser Leu Gly Lys Gly Tyr Ser Asp Gly Gly 
25 \ 30 35 

TOC TAC GAG CAA CTT TTT GTC TCT CCT GAG GTG TTT GTO ACT CTG GOT S58 
Cye Tyr Glu Gin Leu Phe Val Ser Pro Glu Val Phe Val Thr Leu Gly 
40 45 50 55 

OTO ATC AGC TTG TTG GAG AAT ATC TTA GTG ATT GTG GGA ATA GCC AAG 606 
Val He Ser Leu Leu Glu Asn He Leu Val lie Val Ala He Ala Lye 
60 65 70 

AAC AAG AAT CTG CAT TCA CCC ATG TAC TTT TTC ATC TGC AGC TTG OCT 654 
Asn Lye Ann Leu Hie Ser Pro Met Tyr Phe Phe He Cye Ser Leu Ala 
75 80 85 

GTG OCT GAT ATG CTG GTG AGC GTT TCA AAT GGA TCA GAA ACC ATT ATC 702 
Val Ala Aap Met Leu Val Ser Val Ser Asn Gly Ser Glu Thr lie He 
90 95 100 

ATC ACC CTA TTA AAC AGT ACA OAT ACQ GAT GGA CAG AGT TTC ACA GTG 750 
He Thr Leu Leu Asn Ser Thr Asp Thr Asp Ala Gin Ser Phe Thr Val 
105 110 US 



AAT ATT GAT AAT GTC ATT OAC T«"flTO" XS TOT AOC "TCC TTG CTT GGA 798 
Asn lie Asp Asn Val He Asp Ser Val He Cys Ser Ser Leu Leu Ala 
120 125 130 13S 

TCC ATT TGC AGC CTG CTT TCA ATT OCA GTG GAC AGO TAC TTT ACT ATC 846 
Ser He Cye ser Leu Leu Ser He Ala Val Asp Arg Tyr Phe Thr He 
140 145 150 

TTC TAT OCT CTC CAG TAC CAT AAC ATT ATG ACA GTT AAG CGG GTT GGG 8 94 

Phe Tyr Ala Leu Gin Tyr His Asn He Met Thr Val Lye Arg Val Gly 
155 160 165 

ATC AGC ATA AGT TGT ATC TOO GGA OCT TGC ACQ GTT TCA GOC ATT TTG 942 
He Ser He Ser Cys He Trp Ala Ala Cys Thr Val Ser Gly He Leu 
170 175 180 
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FIG. 6B 
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1374 



TTG TCT AGC AGA TAT TAAATGGGGA CAGAGCACGC AATATAGOAA CATCCATAAG 1429 
Leu Ser Ser Arg Tyr 
330 

AGACTTTTTC ACTCTTACCC TACCTGAATA TTCTACTTCT GCAACAGCTT TCTCriXXXH' 1469 
GTAGGGTACT GGTTGAGATA TCCATTGTGT AAATTTAAGC CTATGATTTT TAATGAGAAA 154 9 



AAATGCCCAG TCTCTGTATT ATTTCCAATC TCATGCTACT TTTTTGGCCA TAAAATATGA 1609 
ATCTATGTTA TAGGTTGTAG GCACTGTGGA TTTACAAAAA GAAAAGTCCT TATTAAAAGC 1669 
" 1671 
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FIG. 7A 



ATG AAC TCC TCC TCC ACC CTG ACT OTA TTO AAT CTT ACC CTG t AAC OCC 46 
Met Asn Ser Ser Ser Thr Leu Thr Val Leu Asn Leu Thr Leu Asn Ala 
1 5 10 15 

TCA OAO OAT OOC ATT TTA OOA TCA AAT OTC AAO AAC AAG. TCT TTO OCC -96 
Ser Olu Asp Gly lie Leu Gly Ser Asn Val Lys Asn Lys Ser. Leu Ala 
20 25 30 

TGT GAA GAA ATG GOC ATT OCC GTO OAO OTO TTC CTO ACC CTG GOT CTC 144 
Cys Glu Olu Met Oly lie Ala Val Glu Val Phe Leu Thr Leu Gly Leu 
35 40 45 

OTC AOC CTC TTA OAO AAC ATC CTO GTC ATT GOG OCC ATA OTA AAO AAC 192 
Val Ser Leu Leu Glu Asn lie Leu Val lie Gly Ala lie Val Lys Asn 
50 55 60 

AAA AAC CTO CAC TCA CCC ATG TAC TTC TTT OTG GGC AOC TTA OCC GTG 240 
Lys Asn Leu His Ser Pro Met Tyr Phe Phe Val Gly Ser Leu Ala Val 
6S 70 75 80 

OCC GAC ATG CTG GTG AOC ATG TCC AAT OCC TGO GAG ACT GTC ACC ATA 286 
Ala Asp Met Leu Val Ser Met Ser Asn Ala Trp Glu Thr Val Thr lie 
85 90 95 

TAC TTG CTA AAT AAT AAA CAC CTG GTG ATA GCC GAC ACC TTT GTO CGA 336 
Tyr Leu Leu Asn Asn Lys His Leu Val He Ala Asp Thr Phe Val Arg 
100 105 110 

CAC ATC GAC AAC GTG TTC GAC TCC ATG ATC TGC ATC TCT GTG GTG OCC 384 
His He Asp Asn Val Phe Asp Ser Met He Cys He Ser Val Val Ala 
X1S - 120 125 

TCG ATG TGC AGT TTG CTG OCC ATT GOG GTG GAT AGO TAC ATC ACC ATC 432 
Ser Met Cys Ser Leu Leu Ala He Ala Val Asp Arg Tyr He Thr He 
X30 135 140 

TTC TAT GCC TTG CGC TAC CAC CAC ATC ATG ACC GCG AGG CGC TCG GGG 480 
Phe Tyr Ala Leu Arg Tyr His His lie Met Thr Ala Arg Arg Ser Gly 
145 150 155 160 

GTG ATC ATC GCC TGC ATT TOO ACC TTC TGC ATA AGC TGC GGC ATT GTT 528 
Val He He Ala Cys He Trp Thr Phe Cys He Ser Cys Gly He Val 
165 170 175 

TTC ATC ATC TAC TAT GAG TCC AAO TAT GTG ATC ATT TGC CTC ATC TCC 576 
Phe lie fie'Tyr "Tyx Glu Ser Lys Tyr VaT lie i "He Cys Leu "Tie "Ser 
180 185 190 

ATG TTC TTC ACC ATG CTG TTC TTC ATG GTO TCT CTG TAT ATA CAC ATG 624 
Met Phe Phe Thr Met Leu Phe Phe Met Val Ser Leu Tyr He His Met 
195 200 205 

TTC CTC CTG GCC COG AAC CAT GTC AAG CGG ATA GCA OCT TCC CCC AGA 672 
Phe Leu Leu Ala Arg Asn His Val Lys Arg He Ala Ala Ser Pro Arg 
210 21S 220 

TAC AAC TCC GTG AGG CAA AGG ACC AGC ATG AAG GGG OCT ATT ACC CTC 720 
Tyr Asn Ser Val Arg Gin Arg Thr Ser Met Lys Gly Ala He Thr Leu 
22S 230 235 240 
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FIG. 7B 



ACC ATO CTA CTG GGG ATT TTC ATT OTC TGC TOO TCT CCC TTC TTT CTT 768 
Thr Met Leu Leu Gly He Phe lie Val Cys Trp Ser Pro Phe Phe Leu 
245 250 255 

CAC CTT ATC TTA ATO ATC TCC TQC CCT CAO AAC GTC TAG TOC TCT TGC 816 
Hie Leu He Leu Met He Ser Cys Pro Oln Aan Val Tyr Cys Ser Cys 
260 265 270 



TTT ATG TCT TAG TTC AAC ATG TAC CTT ATA CTC ATC ATG TQC AAC TCC 
Phe Met Ser Tyr Phe Asn Met Tyr Leu He Leu He Met Cya Aan Ser 
275 280 285 



CTC CTT QQC GGG TAT TAA 
Leu Leu Gly Gly Tyr * 
325 



864 



GTG ATC GAT CCT CTC ATC TAC GCC CTC CGC AGC CAA GAG ATG CGG AGG 912 
Val He Asp Pro Leu He Tyr Ala Leu Arg Ser Gin Glu Met Arg Are 
290 295 300 

ACC TTT AAG GAG ATC GTC TGT TGT CAC GGA TTC CGG CGA CCT TGT AGG 960 
Thr Phe Lys Glu He Val Cys Cys His Gly Phe Arg Arg Pro CyB Arg 
305 310 315 320 
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FIG. 14 
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